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These metaphysics of magicians, 

And necromantic books are heavenly; 

Lines, circles, scenes, letters, and characters; 
Aye, these are those that Faustus most desires. 
O, what a world of profit and delight, 

Of power, of honour, of omnipotence, 

Is promis’d to the studious artizan! 
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PREFACE 


The theory of the solid state has increased greatly in complexity during the 
past twenty years, many original papers being incomprehensible except to 
theoretical physicists working in highly specialized fields. As a consequence, 
experimental physicists, metallurgists and others have often to take theoreti¬ 
cal results at their face value, without fully understanding the restrictions 
imposed by the necessary and unavoidable approximations introduced into 
the theory. This sometimes leads to an extrapolation of a theory beyond its 
limits of validity, with results which, if not ludicrous, may draw red herrings 
across the path of scientific progress. 

When he attempts to broaden his theoretical knowledge by reference to 
text-books, the experimental scientist is faced with further difficulty. Books 
on the theory of the solid state generally fall into two classes - elementary 
works which contain little mathematics, and advanced works which pre¬ 
suppose a considerable knowledge both of mathematics and of quantum 
mechanics. Books of the former kind are very useful to those who are 
content to have a superficial knowledge of the theory, but do little to prepare 
the more zealous student for the advanced treatises or for theoretical papers. 
Furthermore, such elementary treatments rarely, if ever, give an adequate 
account of the approximations involved in the theory, which require a 
certain amount of mathematics in their explanation. Indeed, the more 
advanced works are not entirely blameless in this respect, often erecting 
huge structures of formal mathematics upon foundations whose flimsiness 
is, perhaps unwittingly, well concealed. 

The present work is a tentative attempt to bridge the gap between the 
two classes, and at the same time to draw attention to the limitations of the 
various approximate methods in current use. The treatment is mathematical, 
but no more mathematics is required than is normally given in undergraduate 
courses to students of physics, metallurgy and chemistry. Mathematical 
arguments are presented complete with all intermediate steps, and refresher 
accounts of certain mathematical topics are given in appendices. 

I have thought it useful to make the book self-contained, so that it can 
be read without either blind acceptance of ‘well-known’ facts or constant 
reference to other works. Thus, the necessary wave mechanics is developed 
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from first principles, and applications are given to simple one-electron 
systems, atoms and molecules, before dealing with the more complex 
problems of metals and other solids. However, the book does not pretend 
to be a comprehensive survey of the whole of the electron theory of metals - 
the theories of conductivity and ferromagnetism, for example, are not 
treated in detail. The emphasis is upon basic ideas, with the aim of providing 
a sound foundation for further study in more specialized fields. 

Particular attention is paid to the calculation of energies - not only one- 
electron energies but also the energy of the metallic system as a whole. Here 
again, however, the emphasis is upon fundamentals, and the many recent 
techniques used in band structure calculations, which are mainly of interest 
to specialists, are not described. On the other hand, the thorny problem of 
the interaction of electrons, which permeates the whole theory, is discussed 
at length, and there is a chapter on plasma oscillations in metals. Problems 
relating to every chapter are collected together at the end of the book. 

Although the book has been written primarily for graduate students of 
physics, metallurgy and chemistry, who wish to obtain more than a super¬ 
ficial knowledge of the electron theory of metals and of solid state theory in 
general, it will be seen that the greater part of the work is suitable for in¬ 
clusion in undergraduate courses. 

In a book of this sort it would be unnecessarily cumbersome always to 
give references to original papers. This has been done, in fact, only in the 
case of the more recent papers of particular importance and those which 
contain the quoted results of experiment or lengthy calculation. When 
further references are required, they may be found in the works fisted in the 
Bibliography. 

The book has not been read by experts, or even discussed with them, 
before publication, so that any sins of omission or commission are my sole 
responsibility. Since many of the topics covered have not appeared in text¬ 
books before and the approach to others is somewhat unconventional, 
it is too much to hope that errors are not present. I shall be grateful to those 
friends who will point them out to me. 

Finally, my best thanks are due to Mr. J. H. Tripp for his help in reading 
the proofs. 

S. RAIMES 

Imperial College, 

September, 1961 
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CHAPTER 1 


INTRODUCTION 


1.1. Classical Mechanics and Quantum Mechanics 

All bulk matter, whether solid, liquid, or gas, is composed of atoms. 
It is now well known that an atom is not, as was once thought, a particle 
incapable of further subdivision, but is a complex dynamical system, 
consisting of negatively charged particles called electrons and a positively 
charged particle called the nucleus. Indeed, the nucleus of an atom is itself 
an extremely complex structure, consisting essentially of positively charged 
protons and uncharged neutrons. Fortunately, however, as far as the theory 
of solids is concerned, it is generally sufficient to regard a nucleus as a 
particle of negligible size, having a certain positive electric charge and a 
certain mass, these varying from element to element. 

The properties of bulk matter must, in principle at least, be deducible 
from the properties of the electrons and atomic nuclei of which it is com¬ 
posed. It is found, however, that the observed properties of matter cannot 
be explained on the assumption that these particles obey the laws of classical 
mechanics, as formulated by Newton. A new mechanics, called Quantum 
Mechanics, has had to be constructed, and one particular form of this is 
called Wave Mechanics. In this book we shall start with the principles of 
wave mechanics and show how to apply them first to the simplest atomic 
systems and later to the large and important class of solids known as metals. 

It must be understood that these principles of wave mechanics cannot 
be deduced from classical mechanics. There are, however, several alter¬ 
native sets of principles, any one of which may be deduced from any other, 
so that the set which is chosen as fundamental is largely a matter of con¬ 
venience. Although this applies equally well to the principles of classical 
mechanics, it is a fact which is often overlooked, owing to the apparently 
‘self-evident* nature of classical mechanics. For this reason it may be useful 
to digress briefly on the philosophy of physical theory. 

The purpose of theoretical physics is to co-ordinate the known facts 
of experimental physics, and to predict new ones. A physical theory ‘ex¬ 
plains* an experimental result if it shows that this result is consistent with 
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other experimental results, which it will do if all the results can be deduced 
from the basic principles of the theory. As soon as an experimental fact t 
appears which the theory cannot explain, the theory must be amended. 
A new theory is set up, or a new principle is established, which accords with 
the new and unexplained fact. Of course, the new theory must explain all 
the other experimental facts as well, or it is no improvement on the old 
theory. It is important to notice, however, that the basic principle of the 
new theory has no ‘explanation’, except that deductions from it are in 
agreement with the experimental facts. 

We have, of course, been discussing an idealized situation. Although it 
is theoretically possible to deduce the result of any experiment from a 
given set of principles, in practice this may be impossible to do accurately, 
owing to mathematical difficulties. We shall have many examples of this 
later in the book. Thus it may not be possible always to verify a theory in a 
completely satisfactory way. None the less, the theory may be accepted 
so long as the approximate deductions from it are in reasonable agreement 
with experiment, and no well-established deduction is completely at variance 
with any experimental fact. 

When we say that a new basic principle itself has no explanation, we 
do not wish to imply that it never will have. A principle established at one 
level of experience may have an explanation at a more fundamental level — 
a macroscopic principle, for example, may have an explanation in terms of 
atomic motions. However, as we have remarked, at any particular level 
there may be several alternative sets of principles which lead to the same 
results. So long as these sets of principles are consistent with each other, 
they are all equally valid, and it does not matter which is taken to be fun¬ 
damental. Admittedly, it is aesthetically satisfying to have the basic principles 
as simple and as few in number as possible, but from a practical point of 
view this is not important — the most compact set of principles may not 
be the most tractable. The principles we shall adopt in this work are neither 
the most compact, nor the most elegant, but are in a form which leads most 
easily to the results we require. 

It may be reassuring to realize that everything we have said is just as 


We use the term ‘experimental fact’ to denote a well-founded experimental 
result one which is reproducible; that is, one which can be obtained again and again 
by repeating a given experiment. This is to overcome the difficulty that the result of a 

single experiment may conflict with theory simply owing to some chance experimental 
error. 
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true of classical mechanics as it is of quantum mechanics. The most elemen¬ 
tary formulation of the principles of classical mechanics is contained in 
Newton’s equations of motion — and these are not self-evident, but survive 
only because they correctly predict the motion of macroscopic bodies, such 
as billiard balls, ballistic missiles, and planets. However, one could equally 
well take Lagrange’s equations, Hamilton’s equations, or Hamilton’s 
principle as the fundamental principles of mechanics, since these are all 
consistent with each other and with Newton’s equations. It is true that 
Newton’s equations are the simplest, but, for this very reason, they are 
suitable only for simple problems; Lagrange’s and Hamilton’s equations 
are more useful for complicated dynamical problems. 

The situation in quantum mechanics is rather worse. Not only are there 
many alternative sets of initial postulates, but there are two mathematical 
formalisms so different that at first sight they appear to be different theories. 
First, there is matrix mechanics, due to Heisenberg, Born and Jordan, 
and, second, there is wave mechanics, due to de Broglie and Schrodinger. 
The elegant generalization of matrix mechanics, due to Dirac, should also 
be mentioned. Although matrix mechanics and wave mechanics appear to 
have little in common, it can be shown that they are, in fact, equivalent, 
so that we may use whichever is more convenient for a given problem. In 
this book we shall use only wave mechanics, which is in many ways simpler 
and makes a more direct appeal to physical intuition, although it may still 
be worth while to repeat Eddington’s t warning that the popularity of wave 
mechanics may be due to its being ‘simple enough to be misunderstood’. 


1.2. Basic Experimental Facts 

What, then, are the basic experimental facts, which are inexplicable 
by classical mechanics, and which therefore require a new theory? There is, 
indeed, a very wide choice of these, since almost the whole of atomic 
physics, and hence the greater part of modern physics, is quite inexplicable 
by classical mechanics. In order to avoid mathematical difficulties, however, 
it is best to start with the simplest and most clear-cut experimental results, 
and hope that a theory constructed to fit these will also explain the others. 
We shall begin by considering only two experimental facts: (i) the wave¬ 
like behaviour of particles, and (ii) the existence of line spectra. 


t The Nature of the Physical World , Chap. 10. 
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1.2.1. WAVES AND PARTICLES 

In 1900, Planck proposed, in order to explain the distribution of energy 
in the spectrum of a black body, that an oscillating electron may not radiate 
or absorb energy continuously, as required by classical electrodynamics, 
but only in integral multiples of a fundamental unit called a quantum. If 
v is the frequency of the radiation, the energy of a quantum is given by 

£ = /*v, (1.1) 


where h is a constant of nature, known as Planck's constant, which has the 
value 6.624xlO -27 erg sec. In 1905, Einstein went further and suggested 
that light, or electromagnetic radiation in general, might sometimes be 
regarded as having a corpuscular, or particle-like, nature. The light-particles, 
or quanta, are called photons, and have energy given by (1.1). 

Although light had been ascribed a corpuscular nature by Newton, 
this concept had been shelved owing to the many successes of the wave 
theory. Einstein’s proposal explained very well the photo-electric effect — 
when a metallic surface is irradiated with ultra-violet light, electrons are 
emitted with energies which do not depend upon the intensity of the light, 
but only upon the frequency, in a way which suggests that the light con¬ 
sists of particles each of energy hv. 

According to the theory of relativity, the momentum of a photon must 
have magnitude p, given by 


P 



( 1 . 2 ) 


where c is the speed of light and A is the wavelength. This follows immediately 
from the equation which is nowadays popularly known as ‘the equation 
of atomic energy’, that is, 


E = me 2 , (1.3) 

for the energy E of a particle with relativistic mass m. If the speed of the 
particle is v, the magnitude of its momentum is 


using equations (1.1) and (1.3). If we let v = c, the speed of a photon, 
we obtain equation (1.2). This is the only appearance the theory of relativity 
will make in the present book. In future the mass in will be assumed to be 
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independent of velocity; that is, it will denote the rest mass of the particle. 
Photons are peculiar in that they travel with the speed of light and have 
zero rest mass, but we shall not be directly concerned with photons in the 
following work. 

What it is important to notice is that light can be thought of either as 
waves with particle-like properties or as particles with wave-like properties. 
There is no need for us to worry about how a thing can be both a particle 
and a wave; we merely accept that the nature of a given experiment will 
lead to results which emphasize one aspect or the other, and the relations 
between the two aspects are those given in equations (1.1) and (1.2). 

In 1924, de Broglie generalized this idea, and suggested that any moving 
particle, with mass m and speed v, will in some experiments display wave¬ 
like properties, with wavelength 

. h h , 

2- > (1.5) 

p mv 

as in equation (1.2). This theoretical suggestion received experimental 
confirmation in 1927, when Davisson and Germer observed the diffraction 
of a beam of electrons by a crystal of nickel. Electrons emitted from a hot 
filament and accelerated through an electric field to a known velocity were 
allowed to fall upon the surface of a nickel crystal. The reflected beam was 
found to exhibit directed maxima and minima of intensity, as in the case of 
X-ray diffraction, which could be accounted for by the assumption that 
the electron beam consisted of waves with wavelength given by (1.5). Similar 
results have since been obtained with beams of other kinds of particles, 
including atoms and molecules. 

1.2:2. LINE SPECTRA AND STATIONARY STATES 

According to classical electrodynamics, an accelerated electric charge 
will emit radiation. If we accept Rutherford’s model of a hydrogen atom, 
for instance, as an electron moving in an orbit about a nucleus consisting 
of a single proton, we should expect a continuous emission of radiation, 
and hence a continuous loss of energy by the electron. This would result 
in the orbit getting smaller and smaller until eventually the electron would 
collide with the proton. It need hardly be said that such a catastrophe is 
quite contrary to observation. 

Neglecting this difficulty, however, another presents itself. According 
to classical mechanics, a dynamical system can absorb or emit infinitesimal 
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amounts of energy. If the energy of a system, consisting of particles in 
motion, is increased by a small amount, the velocities and orbits of the 
particles will simply be changed by small amounts. This means that a 
classical system could exist with any one of a continuous range of energies. 
If this were true of atomic systems, we should expect a gas of hydrogen 
atoms, for example, to absorb and emit light of all frequencies, that is, we 
should expect a continuous spectrum. In fact, the low frequency end of the 
hydrogen spectrum consists of an infinite series of discrete spectral lines. 
Niels Bohr, in 1913, deduced from this, together with Einstein’s photon 
hypothesis, that an atom can exist in any one of a set of so-called stationary 
states, each with a definite energy, and with a finite energy difference between 
one and the next. Also, when an atom is in a stationary state it does not 
radiate, but radiation of frequency v is emitted when an atom passes from 
a stationary state with energy E 1 to a stationary state with lower energy 
E 2 , where 

E i E 2 = hv. (1.6) 

This is called the Bohr frequency rule. Clearly, if the atom is in the lower 
energy state, the absorption of a quantum hv will transfer the atom to the 
higher energy state. 

In this book we shall be mainly concerned with calculating the energies 
of stationary states, or energy levels, of atoms and systems of atoms. We 
shall be particularly interested in the stationary state with lowest energy, 

or ground state, which is the most stable state of an atomic system_the 

other states are referred to as excited states. Of course, we should mention 
that, as well as the discrete energy levels, there is generally at higher energies 
a continuum of energy levels, which gives rise to a continuous spectrum. 
In the hydrogen atom, for instance, this continuous spectrum corresponds 
to complete ionization in other words, the case when the electron and 

proton are no longer bound together, but have become separate free 
particles. 

Bohr himself proposed a theory of atoms, based upon the classical model 
of planetary electrons circulating about the nucleus, but having the idea 
of stationary states grafted on to it in a rather artificial way. This theory 
accounted very well for the line spectrum of hydrogen, but could not be 
applied directly to more complicated atoms. Thus, although it had several 
partial successes apart from the hydrogen atom, the utility of the theory 
was strictly limited, and it soon gave way to quantum mechanics, which 
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represented a more radical, but at the same time more satisfying, break 
with classical mechanics. We will therefore omit details of the Bohr theory, 
and proceed directly to a discussion of wave mechanics as applied to the 
simplest of all dynamical systems — a single particle constrained to move 
in a straight line. 

1.3. Schrodinger’s Equation for the One-Dimensional Motion of a Single 


Particle 


Let us consider a particle of mass m moving along a line, which we 
may take to be the x-axis, under the action of a force F in the positive x- 
direction. Also let us suppose that the particle has a potential energy 
V(x), so that, according to classical mechanics, F = — dF/dx. Such a 
system, with a potential energy which does not depend explicitly upon the 
time, is called a conservative system, because the total energy is conserved, 
that is, remains constant. In this book, except for the final chapter, we 
shall be concerned exclusively with conservative systems. 

We now postulate, and this will be justified by results, that the state of 
motion of the particle is in some way represented by a function \{/(x) which 
satisfies the equation 



(1.7) 


Here h = h/2n, and E is the constant total energy of the particle, that is, 
the sum of its kinetic and potential energies. Equation (1.7) is called Schro¬ 
dinger’s Equation, or Schrodinger’s Wave Equation, for this system, and 
the function \j/ is called a wave function, for reasons which will become 
apparent. We will leave aside the physical interpretation of \p for the moment 
— and indeed one can get a very long way in the theory without interpreting 
i \) at all — but we will concentrate first upon showing that the Schrodinger 
equation can provide the sort of result that is required to explain the facts 
described in § 1.2. 

In the first place, if the potential energy V is constant, then the kinetic 
energy T is also constant. We have 


T = \mv 2 = E— V, 


( 1 . 8 ) 


where v = x. Thus, if X is the de Broglie wavelength, defined in equation 
(1.5), we find 
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(1.9) 


ana equation (1.7) becomes 



( 1 . 10 ) 


with A constant. 


The general solution of this equation may be written 


= A sin 



( 1 . 11 ) 


where A and 8 are arbitrary constants, known as the amplitude and phase 
respectively. If A and <5 are 
real constants, the graph of 
is thus the sinusoidal wave 



profile, of wavelength A, shown 
in Figure 1.1. The constants 
A and <5 may, of course, be 
complex numbers, in which 
case it is preferable to write 
(1.11) in the alternative form 


Fig. 1.1. Graph of the function 
y> = A sin [{2jijX)x+8]. 


i f/(x) = Be i2nxlx + Ce~ i2nx/X . 


( 1 . 12 ) 


Both the real and imaginary parts of 1 J/ are sinusoidal functions like 
that shown in Figure 1.1. 

We have used the term ‘wave profile’ rather than ‘wave’ because the latter 
term generally denotes a moving wave profile, or a disturbance which is peri¬ 
odic m time as well as in space. Indeed, another equation can be set up, called 
the time-dependent Schrodinger equation, which, for stationary states, has 
solutions of the same spatial form as those of equation (1.7), but multiplied 
by a periodic function of the time. We shall discuss this time-dependent 
equation in Chapter 11, and shall there see that, so long as we concern 
ourselves only with stationary states, and not with the mechanism of tran¬ 
sitions between one stationary state and another, the time-dependence of the 
wave functions has no significance. Except in that final chapter, we shall, in 
fact, confine our discussion to stationary states, and so may ignore the 


time-dependence. 
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What we have shown is that the wave function of a free particle, that is, 
a particle acted upon by no forces (d V/dx = 0), has a wavelength which is 
given by the de Broglie relation (1.5), as required. Furthermore, if in some 
region of the x-axis F(x) varies only very slightly with x, equation (1.10) 
will hold approximately in this region and if/ will be very nearly a de Broglie 
wave profile. Of course, equation (1.7) goes beyond this: it offers a prescrip¬ 
tion for obtaining i p even when the de Broglie wavelength can have no 
meaning, owing to the rapid variation of V with x. Although this generaliza¬ 
tion is a natural one, its only justification will lie in the results which can 
be derived from it and compared with experiment. The results in which 
we shall principally be interested are the energy levels of the system; and 
this brings us to our second point — that discussed in § 1.2.2. 

We deduce from the foregoing work that a free particle can have any 
energy. Thus, whatever the value of £, there is a corresponding value of A 
and a wave function given by equation (1.11). No question of discrete 
energy levels arises in this case. However, this is not so in the case of a 
bound particle, that is, a particle acted upon by a force which would, ac¬ 
cording to classical mechanics, confine the particle to a finite region of 
the x-axis. Such a situation is shown in Figure 1.2. According to classical 
mechanics, a particle with total energy E and potential energy F(x), as 
shown, would be confined to the region a ^ x ^ b, since outside this 
region the kinetic energy, T = E—V, would be negative. On reaching 
x = b, therefore, the particle would be reflected and would travel to x = a, 
where it would again be reflected, and so on. However, classical mechanics 
permits the energy E to have any value lying 
above the bottom of the potential trough. 

Wave mechanics presents quite a different 
picture: we shall see below that the impo¬ 
sition of certain very natural conditions 
upon the wave function 1 1/ results in a set of 
discrete energy levels for bound states, and 
energy values lying in between two adjacent 
levels are not permitted. 

The conditions we impose upon i j/ are 
simply that it be single-valued , continuous, pig. j 2 . 

smooth [except at infinities of F(x)], and 

finite everywhere. The first three of these, in fact, are conditions we should 
normally take for granted in solving a differential equation of the type 
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of equation (1.7). If V(x) becomes infinite at any point, d 2 \J//dx 2 and hence 
the curvature of i f/ become infinite, so that \{/ is not expected to be smooth 
at that point, but to show a sharp corner there. The last condition, that 
^ be finite everywhere, is related to a fifth condition which we shall later 
discuss in connection with the physical interpretation of the wave function, 
but which we do not require for the moment. Perhaps it should be 
remarked in advance that, although \j/ = 0 is always a solution of equation 
(1.7), it is not permitted as a wave function. 


1.4. Particle in a Rectangular Potential Well 

Let us suppose that the potential energy V(x ) of a particle of mass m 
has the constant positive value V 0 when x < 0 (region I) and when x > L 
(region II), and is zero when 0 ^ x ^ 

We wish to find the energy levels 
of the bound states, that is, states 
with total energy E less than V 0 — 
the total energy is necessarily posi¬ 
tive, of course, since V ^ 0. We 
therefore have to find the values of 
E < V 0 for which the Schrodinger 
equation (1.7) has solutions satisfying 
the conditions stated above. Let us 
first consider the three regions separately. 


In region I, V(x) = V 0 , and equation (1.7) becomes 


0 + ^( £ _ w =o. 

(1.13) 

The general solution of this is 


•A = ^ 1 e ax + B 1 e““*, 

(1.14) 

where A x and B x are arbitrary constants, and 



(1.15) 


a is real, since E < V 0 , and we will take the positive value of the square 
root. 

In region II the equation is the same, and we have 


L (region III), as shown in Figure 1.3. 


V(x) 

1/ 


© 

v o 

® 

® 


O Lx 

Fig. 1.3. 
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\j/ = A 2 e ax + B 2 e~ ax , 
with a again given by (1.15). 

In region III, however, V(x) = 0, and the equation becomes 

dV 2 mE , _ 

d? + ”fT ^ = °’ 

with the general solution 

ij/ = A 3 cosfix + B 3 sin fix, (1.18) 

where A 3 and B 3 are arbitrary constants, and 

P = s/(2mE/h 2 ). (1.19) 

We note that the second term of the expression (1.14), i^e - ®*, since a 
is positive, may be made indefinitely large by taking a sufficiently large 
negative value of x. This violates the last condition on i p, which demands 
that \j/ be finite everywhere. We must therefore set B x = 0. 

The expression (1.14) is only valid for x < 0, so we are not concerned 
about what happens to the first term when x becomes large and positive. 
In the case of expression (1.16), however, which is valid for x > L, it is 
the first term which causes difficulty, for A 2 e ax may be made indefinitely 
large by taking x sufficiently large and positive. We must therefore set 
A 2 = 0. 

The three forms of the solution are, thus, 

4/ = A t e ax , x < 0, (1.20) 

\J/ = A 3 cos fix + B 3 sin fix, 0 ^ x ^ L, (1.21) 

4 / = B 2 e~ ax , x > L. (1.22) 

Now the problem is to determine the values of the constants so that 

4/ is continuous and smooth everywhere. To do this we simply have to 
ensure that 4 > and di/f/dx are continuous at x = 0 and at x = L — in other 
words, that the solutions in adjacent regions give the same values of these 
quantities at the boundaries. 

The continuity of 4 / at x = 0 gives 

A i = A 3 , (1.23) 

and that of di/^/dx gives 

ocAi = f$B 3 . ( 1 . 24 ) 


(1.16) 

(1.17) 
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Similarly, the continuity of i j/ at x = L gives 

A 3 cos JSL + B 3 sin j3L = B 2 e~ aL , 

and that of dip/dx gives 

—/L4 3 sin pL+fiB 3 cos fiL = — xB 2 e~ aL . 

Dividing (1.26) by (1.25), we obtain 

A 3 sin fiL — B 3 cos /?L _ a 
A 3 cos PL + B 3 sin j3L ft 

After rearrangement, this yields 


tan pL = ~ , 
p 2 -* 1 


or, from (1.15) and (1.19), 


tan 



2 Ve(F 0 -E) 
2 E-V 0 


(1.25) 

(1.26) 


(1.27) 


(1.28) 


This equation must be solved numerically for E, given the values of V 0 
and L. It is immediately apparent that the equation may be satisfied by 
one or more discrete values of E, but not by a continuous range of energies. 
We thus see that the use of the Schrodinger equation, together with the 
conditions on ij/, leads to a set of discrete energy levels for bound states. 

We shall not proceed further with the solution of (1.28), because it will 
be sufficient for our future purposes to consider the idealized case in which 
V 0 tends to infinity — the problem is then generally referred to as that 
of ‘a particle in a one-dimensional box with perfectly reflecting walls’. In 
this case it is quite easy to obtain an analytical expression for E. Indeed 
this may be obtained immediately from (1.28), by setting the right-hand 
side equal to zero, but it is instructive to work from first principles. 


1.5. Particle in a One-Dimensional Box 

If, in the foregoing problem, we let V 0 tend to infinity, then a tends 
to infinity also, and the solutions in regions I and II, given by (1.20) and 
(1.22), both tend to zero. Thus, in the limit, the problem is to determine 
.he constants in the solution (1.21), for the region III, so that this solution 
is zero at x = 0 and at x = L. 
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— 0 gives us A 3 =0, and = 0 gives us 


that is, 


B 3 sin /?L = 0, 


P = nn/L, n = ± 1, ±2, +3,. . 


excluding the possibilities B 3 = 0 and /? = 0, which would give the disal¬ 
lowed zero solutions. 

Since /? = 2 ), the allowed energy levels are given by 


E 


n 


ftW 

2m L 2 


n = 1, 2, 3, . . 


(1.29) 


The corresponding wave functions, in region III, are given by 


ip n = B n sin 


MIX 


n = 1, 2, 3,..., 


(1.30) 


where B n is so far an arbitrary non-zero constant. The negative integral 
values of n merely repeat the same energies and wave functions, and hence 



Fig. 1.4. Wave functions for a particle 
in a one-dimensional box ( V 0 infinite). 


The wave functions for the first three values of n are shown in Figure 1.4, 
arbitrarily taking B n to be the same real constant in each case. It should 
be noticed that, apart from the zeros at a = 0 and at x = L, which do 
not occur when V 0 is finite, i j/ n has («— 1) zeros or nodes — the ground 
state i pi has none, the first excited state i J/ 2 has one, and so on. This is 
generally the case, and provides a rough method of estimating which 
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of two wave functions corresponds to the lower energy, when the energy 
levels are not immediately available. 

The wave functions are all zero outside region III, and it will be seen 
that there is a discontinuity of slope at x = 0 and at x — L. This causes 
no difficulty, however, when it is remembered that it is the result of a 
limiting process. For any finite value of V 0 , however large, there would 
be no discontinuity of slope, but a smooth exponential decrease in regions I 
and II — the wave function for the state of lowest energy has the form 
shown in Figure 1.5 in this case. As V 0 increases, the exponential decrease 



Fig. 1.5. Wave function for the ground state 
of a particle in a rectangular potential well 
(K 0 finite). 

becomes more and more rapid, until, in the limit of infinite V 0 , we get 
the situation shown in Figure 1.4. 

The energy levels E n and the wave functions \J/ n of the stationary states 
are frequently referred to as the eigenvalues and eigenfunctions , respectively, 
of the Schrodinger equation. 

The energy levels of the simple system we have been discussing cannot 
immediately be compared with experiment, but we shall see in Chapter 7 
that a generalization of this model to three dimensions gives results which 
are in remarkable agreement with experiment. The wave function if/ has 
so far been merely a tool for obtaining the energy levels. While this is 
undoubtedly its principal purpose, it has also proved possible to interpret 
if/ physically in a way which leads to useful results. 

1.6. Interpretation of the Wave Function 

The wave function if/ is generally a complex function, that is, it may be 
written in the form 


•AO) = /(-v) + ig(A'), 


( 1 . 31 ) 
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where/and g are real functions of x , and i = V — 1 . The complex conjugate 
of ip, which we denote by ip*, is obtained by changing i to -i wherever 
it appears in ip. Thus, 

•A* =/~i£. (1.32) 

The product of ip and ip* is real and positive, and its positive square 
root, denoted by \ip\, is called the modulus of i p. Thus, 


\P*iP = \ip\ 2 =f 2 +g 2 . (1.33) 

Now, a further condition upon ip, which we have not yet stated, but 
whose significance will shortly become apparent, is that the integral of 
\ip\ 2 over all x must be a finite number and not zero, that is 


(*QO 

I \ip\ 2 dx = finite number. (1-34) 

' - oo 


This integral could only be zero, in fact, if ip were identically zero, which 
we have already said is not permitted for a wave function. 

Schrodinger’s equation is linear, so that, if ip is a solution, so also is 
Ctp, where C is any constant. This may easily be verified by substitution. 
The wave function is, in fact, always arbitrary to the extent of a multiplying 
constant, and it follows that we can always choose ip so that 

f” = 1. (1-35). 


The function \j/ which satisfies this equation is said to be normalized ^. 

Since ip is generally complex it is clear that \j/ itself cannot have a direct 
physical meaning. However, it has been suggested, with successful results, 
that, when ip is normalized, |i^| 2 may be interpreted as a probability density , 
such that 

\i//\ 2 dx is the probability that the particle be found in the small interval 
d„r at position x. 

More explicitly, if a very large number of observations of the position 


f It should be mentioned that, although there are no difficulties about the normalization 
of the wave functions of bound states, difficulties do occur in the 'case of free particles. 
The energy levels of the former belong to the discrete spectrum, while those of the latter 
belong to the continuous spectrum, and in this case equation (1.34) is not satisfied. This 
difficulty with free particles is simply overcome by doing away with free particles! In 
other words, every system is considered to be enclosed in a box — clearly, if this box is 
very large compared with the system under consideration, its presence cannot affect the 
predicted results of experiments on the system, but the limits it imposes upon the range 
of any particle gets over the mathematical difficulties of normalization. 
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of a particle in the state t j/ were made at random times, in a fraction \\J/\ 2 dx 
of them the particle would be found in dx. 

It is essential for this interpretation that i p be normalized, for the particle 
must be somewhere, so that the sum of the probabilities of its being in every 
interval from — oo to +oo must be unity, that is, 



2 dx = 1. 


It also follows from this interpretation that i j/ cannot be identically zero. 

As an example, let us consider the particle in the one-dimensional box. 
We have seen that the wave functions are zero outside the box, and inside 
it are given by 


, „ . nnx 

\K = B n sm —, 

Xj 


n = 1, 2, 3, . . .. 


In order to normalize \J/„ we must choose B n so that 


C L . 2 nnx , 

B„ sin -dx = 1, 

vo L 


and this gives 

B n = V2/Z, 


(1.36) 

(1.37) 


which is independent of n. The normalized wave functions are, therefore, 
given by 


•An = 



n = 1, 2, 3,. . ., 


(1.38) 


inside the box. 

Here we have assumed that B„ is real, but this is not necessary — we 
could multiply the value of B n given in (1.37) by e' 6 , where b is any real 
number, and the wave functions would still be normalized, because 
|e ,& | = 1. In other words, even a normalized wave function is arbitrary 
to the extent of a multiplying constant with modulus unity. This again 
emphasizes the fact that the wave function itself has no direct physical 
meaning — the value of |<A| 2 , which we have interpreted physically, is 
independent of any factor e ,ft in t j/. It is due to this fact that the time- 
dependence of stationary states is of no significance, for we shall see in 
Chapter 11 that the time t only appears in the wave function of a stationary 
state in a factor of the form e ,6 ‘, which has modulus unity. 
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The graph of , the probability distribution for the ground state of 
the particle in the box, is shown in Figure 1.6. This is quite a different 
picture from that presented by classical mechanics. According to the latter, 
a particle in such a box would travel with 
uniform velocity from wall to wall, and at 
the walls it would be perfectly reflected — 
there would consequently be the same 
probability of finding it anywhere in the box, 
and its probability distribution would be a 
straight line at height 1/L above the axis, 
as shown. According to wave mechanics, Fig . L6 . Probability distribution 
however, there is a much greater probability for the ground state of a particle 

of finding the particle in the middle of the in a box - The horizontal line at 

. n height 1/L is the probability 

box than near the walls. .. . , 

distribution according to classical 

The discrepancy between classical and mechanics, 

quantum mechanics is even more pronounced 

in the case of finite potential walls. We have seen that here the wave 
functions do not become zero at the walls, but fall exponentially to zero 
in the regions beyond the walls — the wave function for the ground state 
is shown in Figure 1.5, and the probability distribution has a similar form. 
This means that, according to wave mechanics, when the potential energy 
V 0 is finite, there is a non-zero probability of finding the particle outside 
the box. Such a thing is quite impossible in classical mechanics, according 
to which a particle whose energy is less than V 0 could never pass beyond 
the walls — in order to do so its kinetic energy would have to become 
negative, which is forbidden. The passage of a particle through a potential 
barrier, which would be impenetrable according to classical mechanics, is 
known as the ‘tunnel effect’, and is important, for instance, in the theory 
of radio-active decay. 


1.7. Orthogonality of Wave Functions 


If i j/ x and \j/ 2 are wave functions corresponding to the different energy 
levels E t and E 2 respectively, then 



* ~ 00 


2 


dx 


= 0 , 


(1.39) 


and the two functions are said to be orthogonal. Alternatively, taking the 
complex conjugate of both sides of equation (1.39), 
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(1.40) 


To prove this, let us first note that the Schrodinger equation for \j/*, 
the complex conjugate of 1 {/, is the same as that for i J/ itself, so long as the 
potential energy V(x) is real; thus, 


dV* 

dx 2 



[£-F(x)]<A* = 0. 


(1.41) 


This can easily be seen by noting that, if i f/ =f+ig, where / and g are real 
functions of x, then both / and g separately must satisfy equation (1.7), 
and it follows that any linear combination of the form Af+Bg, where A 
and B are arbitrary constants, must also satisfy it — in particular, the linear 
combination i j/* = f—ig must satisfy it. 
i/f* and \j/ 2 therefore satisfy the equations 

TT + (!•«) 

dx n 

+ 2 -£lE 1 -V(xW 1 =0. (1.43) 

dx n 


Now, if we multiply the first equation by t f/ 2 and subtract from it the second 
equation multiplied by i^*, we find 


•A: 


d 2 -'-* 


Vi 


dx 2 


- ** Pj + ^(£,-£ 2 K* 2 = 0. 

dx n 


(1.44) 


Integrating over all x gives 




dx 


dx _ 



(1.45) 


since 



In order that equation (1.34) be satisfied, \j/* and \f/ 2 must tend to zero 
as x tends to oo or to — oo, and the left-hand side of equation (1.45) there¬ 
fore vanishes. Also, we have postulated that E l and E 2 are different energy 
levels, so that (E 2 —E 1 ) is not zero, and equation (1.39) follows. 

In some problems it may not be necessary to consider the whole range 
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of x from — oo to +oo. For instance, if t ]/* and i p 2 were both zero at 
x = a and at x = b, then, integrating from x = a to x = b instead of 
over all a, we would find, in the same way, 

rb 

\]/*\J/ 2 < ix = 0. (1.46) 


£ 


In this case the functions are said to be orthogonal in the interval (a, b). 
Similarly, if 



(1.47) 


the function if/ is said to be normalized in the interval (a, b). 

A set of functions i]/ n , where n = 1, 2, 3,. . ., which are all normalized 
and mutually orthogonal in an interval (a, b), are said to form an orthonormal 
set in this interval. As an example, we may take the normalized wave 
functions of a particle in a potential box: 

. 1/2 . nnx . „ . 

<A„ = Y - sin — , n = 1, 2, 3, . . .. 

Ld Li 


We have shown that these functions are normalized in the interval (0, L ). 
Also, if m and n are different integers, 



mnx . nnx , 

-sin-dx 

L L 


2Jo L 


(m — n)nx (m + n)nx 

cos ^---cos -- 


J dx = 0. 


(1.48) 


Hence, the functions ip„ are mutually orthogonal in the interval (0, L), 
and so form an orthonormal set in this interval. The functions are real in 
this case, so that ip* is the same as i f/„. Since the wave functions are zero 
outside the interval (0, L), they are also orthonormal in the whole range of 
x, that is, the interval ( — oo, oo). Further properties of orthonormal sets 
are discussed in Appendix 2. 


1.8. The Harmonic Oscillator 

Before leaving the one-dimensional Schrodinger equation it may be in¬ 
structive to consider a final example, which has many applications throughout 
the whole of atomic physics — the linear harmonic oscillator. According 
to classical mechanics this is defined as a particle moving in a line under 
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the action of a force directed towards a fixed point in the line and of magni¬ 
tude proportional to the distance of the particle from this point. Let us 
take the fixed point to be the origin, the line to be the x-axis, and the force 
to be — kx, where k is positive. Then, if m is the mass of the particle, its 
Newtonian equation of motion is 

m = — kx, (1-49) 

d t 2 

which has the general solution 

x = A sin (cot + <5), (1.50) 

where co = ^(k/m), and A and <5 are arbitrary constants, which may be 
fixed by the initial conditions. The particle 
therefore performs simple harmonic oscillations 
with frequency o)/2n, and co is called the 
angular frequency of oscillation. 

The potential energy V(x) is obtained from 
the equation 

= kx = mco 2 x. (1.51) 

dx 

This gives 

V(x) = \moo 2 x 2 , (1-52) 

if the zero of potential energy, which is arbi¬ 
trary, is taken to be at x = 0. This function is shown in Figure 1.7. 

Passing now to wave mechanics, the Schrodinger equation, obtained 
by substituting (1.52) in (1.7), is 

+ 7r( £ -i wc ° 2 x 2 >A = 0. (1.53) 

dx n 

Let us consider the function 

<Ao = e" 



Fig. 1.7. Potential energy 
V(x) — ima> 2 x 2 and energy 
levels E n = (n-\-l)hco for the 
harmonic oscillator. 


where oc = mco/lh. We have 


(1.54) 
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and 

— ^° = —2oce~ ax2 + 4(x 2 x 2 e~' zx2 
dx 2 

2m 

= -- (^h(jo-%mco 2 x 2 )il/ 0 . (1.55) 

h 

Equation (1.55) is the same as (1.53), provided E = \h(n. Also, the function 
i/'o satisfies all the conditions for a wave function. We have therefore shown 
that ij/ 0 is an eigenfunction of the Schrodinger equation corresponding 
to the eigenvalue 

E 0 = }hco. (1.56) 

What we have not shown, but what is none the less true, is that \J/ 0 is the 
ground state of the oscillator, that is, E 0 is the lowest energy level. We might 
have guessed this, however, for \p 0 has no nodes, and we have stated 
previously that this is a characteristic of the ground state wave function. 

Following the same argument, we expect the state of next higher energy, 
or first excited state , to have a single node, which, by symmetry, must 
occur at the origin. Now the function 

•h = xe~ xx2 k (1.57) 

has a node at the origin, and satisfies all the conditions for a wave function. 
Also 

= — 6ax e -a * 2 + 4a 2 x 3 e ““* 2 (1.58) 

dx 2 

= — ^ (flico — ^m<x> 2 x 2 )\j/ l , (1-59) 

h 2 

which is the same as equation (1.53), provided E = f/ico. Thus, is an 
eigenfunction of the Schrodinger equation corresponding to the eigenvalue 

E x = pco. (1.60) 

The reader may verify that the second excited state, with two nodes, is 

= (l — 4<xx 2 )Q~ ax2 , (1.61) 

corresponding to the eigenvalue 

E 2 = 4 hco. (1.62) 

Of course, the eigenfunctions need not be obtained by guesswork, but 
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may be found by a straightforward process of calculation. This is somewhat 
lengthy, however, and since it is available in a large number of texts + 
we shall not reproduce it here. 

In general, the eigenvalues or energy levels are given by the formula 


E n = (n+\)hw, n= 0,1,2,- (1.63) 


The eigenfunctions given above are not normalized, out may be made so 
by multiplying each of them by an appropriate constant, so that equation 
(1.35) is satisfied. We obtain 

(3. .64) 

=2 (ir) * e ~** 2 , (1-65) 

2 = (1 — 4ax 2 )e -a *\ (1.66) 


These functions are shown in Figure 1.8. 

It should be noticed that in any state, according to our interpretation 
of \ij/ 2 \, there is a finite probability of 


finding the particle at large distances from 
the origin. This again is contrary to clas¬ 
sical mechanics — classically permissible 
ranges of motion of the particle in the 
various energy levels E n are shown by the 
horizontal line segments in Figure 1.7. 

Another point which is worth noticing 
is the following: according to classical 
mechanics the oscillator could have zero 
energy, in which case the particle would 
be at rest at the origin, but according to 
wave mechanics this is impossible — the 
lowest energy the oscillator can have is 



Fig. 1.8. Normalized wave func¬ 
tions for the three states of lowest 
energy of the harmonic oscillator. 
The maximum value of y> 0 is 
(2a/ ttH, and the extrema of \p x 
occur at ±\/(l/2a). 

which is called the zero- 


point energy of the oscillator. This fact has some importance in the theory 
of metals. For instance, the atomic nuclei in a metal vibrate like harmonic 


„.;,f ee - for lnstance > Paulmg and Wilson, Introduction to Quantum Mechanics (McGtaw- 
Hill), p. 67, for a very full treatment. 
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oscillators. At the absolute zero of temperature, when classical physics 
would have all the nuclei at rest, they must in fact still vibrate with their 
zero-point energies, and it is sometimes necessary to take this into account 
in calculating, for example, the cohesive energy of a metal. Another im¬ 
portant application, to the theory of plasma oscillations in metals, will 
be discussed in Chapter 10. 


CHAPTER 2 


THE BASIC PRINCIPLES OF WAVE MECHANICS 


2.1. Hamiltonian Mechanics and the Schrodinger Equation 

The classical Hamiltonian function of a conservative system consisting 
of d single particle moving in a line is simply the total energy of the particle 
expressed in terms of its momentum p and positional coordinate x. It 
is denoted by H(x,p). 

The total energy E is given by 

E = kinetic energy+potential energy 

= ±mv 2 +V(x), (2.1) 

where v = x, as before. Also, the momentum is p = mv, so that the classical 
energy equation in Hamiltonian form is 

H(x,p) — E, (2.2) 

H(x, p) = — + F(x). (2.3) 

2 m 

— - - and dH - dV 
dp m 8x dx 

However, pjm = v = x, and, according to Newton’s equations of motion 
[or, alternatively, by differentiating equation (2.2) with respect to time, 
remembering that E is constant], 



where 


We thus have 


dV 

-- mv 

dx 


= P, 


so that equations (2.4) may be written 


— = x and 
dp 


dH 

dx 


= P■ 


(2.5) 
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These are called Hamilton’s canonical equations of motion for the system, 
and the variables p and x which satisfy them are said to be canonically 
conjugate. 

Now, if we tentatively write 


fi d 
i dx 


in (2.3), we obtain the operator 


H 



tf_ d^_ 

2m dx 2 


+ V(x), 


( 2 . 6 ) 


(2.7) 


and equation (2.2) becomes an operator equation. Operating with both 
sides upon some function \f/(x ), we obtain 



h(x,|—W =£<A, 

(2.8) 


\ i dx/ 

or 

- f= **. 

2m dx 

(2.9) 


which may be re-arranged in the form of equation (1.7), and is just the 
Schrodinger equation of the system. 

It seems, therefore, that there is a close relationship between the classical 
energy equation in Hamiltonian form and the Schrodinger equation. The 
essential connection is the representation of the momentum by the dif¬ 
ferential operator (2.6). It is, in fact, a fundamental principle of quantum 
mechanics that every dynamical variable may be represented by an operator. 
For instance, in the operator (2.7), the coordinate x has really become 
the operator ‘multiply by x’, but since this behaves just like the number 
x itself it was unnecessary to state this explicitly. It is a fact which must 
be borne in mind, however, for these operators are not the only possible 
ones which may be used to represent p and x, and others are sometimes 
more convenient. 

The foregoing treatment of the linear motion of a single particle suggests 
the way in which we might generalize the Schrodinger equation to account 
for the three-dimensional motion of a particle or of many particles. We 
will write down the classical Hamiltonian of the system in terms of the 
Cartesian coordinates and momentum components of each particle, and 
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then transform it into an operator by substituting for each momentum 
component a differential operator like (2.6). The Schrodinger equation 
will then be 

Hp = Eip, (2.10) 

where H is the Hamiltonian operator and ij/ is a function of the Cartesian 
coordinates of all the particles in the system. Once more, it must not be 
imagined that this is a logically necessary process — it is a tentative process, 
which will be justified by its results. 

Let us take the case of a single particle in three dimensions first. The 
potential energy V(x, y, z) is now a function of x, y, and z, the rectangular 
Cartesian coordinates of the position of the particle, and the momentum 
is of magnitude p, given by 

2 2 , 2,2 

P = Px+Py+Pz, 

where p x ,p y , and p z are the Cartesian components of the momentum 
vector. The classical Hamiltonian, that is, the total energy of the system 
expressed in terms of the momentum and positional coordinates of the 
particle, is thus 

H = ~—(pl + Py+ Pz) + V(x, y, z). (2.11) 


As before, we may show that 


— = x and 

dp x 



( 2 . 12 ) 


with two similar pairs of equations in y and z. These are Hamilton’s 
canonical equations of motion for the system. The coordinate x and mo¬ 
mentum p x are said to be canonically conjugate, and so also are the pairs 
of variables (y,p y ) and (z,p z ). 

In order to transform (2.11) into the quantum-mechanical Hamiltonian 
operator we substitute for each momentum component a derivative with 
respect to its conjugate variable — as in (2.6), except that partial derivatives 
are now required, since several variables are present. In other words, we 
write 




h d 
i dz ’ . 


Pz 


(2.13) 
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and the Hamiltonian operator becomes 

H= -^,[B + ^ + B) +v(x ’ y - z) ' (ii4) 

Substituting this in equation (2.10), we obtain the Schrodinger equation 


h 2 (8 2 ij/ 8 2 il/ d 2 iI/\ T/ . , 

(0? + s? + a?) + (x ’^ ^ = (215) 


2m 


where i]/ is a function of a% y and z. 

The differential operator in (2.14) is generally called the Laplacian 
operator, and may be written V 2 , thus, 


v 72 d 2 d 2 d 2 
dx 2 dy 2 dz 2 ' 


(2.16) 


The Schrodinger equation may therefore be re-arranged in the form 


O Ml 

V 2 i/< + — [£- Vfx, y, z)]i ji = 0, 


(2.17) 


which is similar to equation (1.7). 

The extension to systems of several particles follows immediately. Let 
us suppose that there are n particles and that the z'th particle has mass m u 
rectangular Cartesian coordinates (x h y if z^), and momentum components 
(p X i, Pyi, p Z i)- Then the classical Hamiltonian is 

H = E -— + V(x u y u z u ... x„, y„, z „), (2.18) 

i=i 2m,- 


where pf = ph+ph+pli, since pf/2m,- is the kinetic energy of the zth 
particle and the sum is over all the particles. The potential energy V is a 
function of the positional coordinates of all the particles. 

In accordance with (2.13) we now make the substitutions 


h d _ h d _ h d 

Pxi = T T > Pyi = T . > Pzi T 3 > 

i dx t i oyi l 0Zi 


(2.19) 


for all i, obtaining the Hamiltonian operator 

" h 2 

H = - ^ — V? + V(x u y u z t ,... x n , y n , z„), 

i= i 2m,- 


( 2 . 20 ) 
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where 



( 2 . 21 ) 


The Schrodinger equation then becomes 


-I — VjV+F«A = Eils, 


i=i 2m* 


( 2 . 22 ) 


where ij/ is a function of all the x^y^Zi. 

2.2. The Wave Function and its Interpretation 

Let us first consider a single particle moving in three dimensions, whose 
Schrodinger equation is (2.17). By direct extension of the one-dimensional 
case, we assume that its wave function y, z) must be single-valued, 
continuous, smooth (except at infinities of V ), and finite everywhere. 
Smooth here implies that the partial derivatives of i}/ with respect to x, 
y and z are all continuous, or, more concisely, that the gradient of i/q 
written grad if/ or Vi p, is continuous (see Appendix 1). In addition, we 
require that the volume integral of i/Ai/i or \\j/\ 2 throughout all space be 
finite and not zero, that is 



number. (2-23) 


The Schrodinger equation is linear, so that any constant multiple of a 
solution is also a solution, and it therefore follows that we can choose 
\j/ so that the above integral has the value unity. In this case is said to be 
normalized. In the following work we shall find it convenient to write 
i P(r) instead of ij/(x , y, z ), where r is the position vector of the point 
(x,y,z), and also to write dr for the volume element d.vdydz. An integration 
throughout all space is then represented by a single integral sign. Thus, 
for a normalized wave function i/q we have 



(2.24) 


Again, as for the one-dimensional case, when i J/(r) is normalized, |i^(r)| 2 
is interpreted as a probability density, such that |^r(r)| 2 dr is the probability 
that the particle be found in the small volume element dr at r. 

In the case of a system of several particles, whose Schrodinger equation 
is (2.22), the conditions upon i j/, and its normalization, are almost exactly 
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the same as above — the only difference being that the integral of \ij/\ 2 
is no longer a triple integral, but a multiple integral with respect to the 
positional coordinates of all the particles. Thus, for the normalized wave 
function of a system of n particles, we have 



(2.25) 


each integral being a volume integral throughout all space. 

Similarly, if i p(r x , . . . r„) is normalized, \i[/(r l , . . . r n )\ 2 dr 1 . . . d r n is 
interpreted as the probability that particle 1 (that is, the particle with 
position vector rj be found in the small volume element di-j at r lt and 
simultaneously that particle 2 be found in dr 2 at r 2 , and so on. 

The 3«-dimensional space defined by the coordinates of the n particles 
is generally known as the configuration space of the system. As a further 
abbreviation we shall sometimes denote a volume element in configuration 
space by dr, that is, 


dt = dr l dr 2 .. . d r n . 


(2.26) 


An integration throughout the configuration space of a system is then 
denoted by a single integral sign — for example, (2.25) becomes, simply, 



(2.27) 


2.3. The Particle in a Box 

As a simple example of the motion of a particle in three dimensions 
let us consider a particle confined in a cubical potential box of side L. 
Although highly idealized, this model has an important application in the 
theory of metals. For simplicity we will take the potential energy of the 
particle to be zero inside the box and infinite outside, so that the problem 
is just the three-dimensional extension of that discussed in § 1.5. It fol¬ 
lows that the boundary condition upon the wave function \{/(x, y, z) is that 
it must fall to zero at the surface of the box. 

Let us take the origin at one corner and Cartesian axes along three 
edges of the box, as shown in Figure 2.1. The variation of the potential 
energy V(x, y, z) along the x-, y- and z-axes is then precisely the same 
as that in the problem of the one-dimensional box. 
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Inside the box, V = 0, and the Schrodinger equation (2.17) becomes 


dy ay ay 

dx 2 dy 2 dz 2 


2 m _, 

+ 77 = °- 

n 


(2.28) 


The procedure now is to look for solutions of the form 

^(x,y,z)=X(x)Y(y)Z(z), 


(2.29) 


where Z, F and Z are functions of one variable only. Such solutions are 
called separable solutions, and, when they 
exist, all possible solutions of the dif¬ 
ferential equation may be constructed 
by taking linear combinations of them. 

Separable solutions of the Schrodinger 
equation do not always exist, however — 
this depends entirely upon the form of 
the potential energy function — but we 
shall see that, even when separable solutions 
do not exist in one system of coordinates, 
it may be possible to obtain them by trans¬ 
forming to a different coordinate system. 

No difficulty is experienced in our present example. We substitute (2.29) 
in (2.28), and obtain 



Fig. 2.1. 


YZ 


d 2 X 


d 2 Y 


d 2 Z 2m 


, +XZ — 7 +XY —- + EXYZ = 0, 
dx 2 dy 2 dz 2 h 2 


or, dividing through by XYZ, 

1 d 2 X | 1 d 2 F 1 d 2 Z 2mE 
X dx 2 Y dy 2 Zd? + 


(2.30) 


Now, the first term in this equation is a function of x only, the second a 
function of y only, and the third a function of z only, while the sum of 
the three is a constant. This can only be so, in fact, if each term is equhl 
to a constant. We must therefore have 


1 d 2 X 2m „ 

- T = - T Ex 

X dx 2 lx 2 

1 — 7 = _ 2m 
Ydy 2 ~ ¥ y 


(2.31) 

(2.32) 










Ch. 2, § 3] 


THE PARTICLE IN A BOX 


31 


1 d 2 Z _ _ 2m 
Z dz 2 ~ h 2 


(2.33) 


where E x , E y and E z are constants, such that 

E x + E y + E z = E. (2.341 

The partial differential equation has thus been separated into three ordinary 
equations. 

Equation (2.31) is just 


d^r 

dx 2 


+ 


2 mE x 


X = 0, 


(2.35) 


which, apart from the slightly different notation, is the equation for the 
one-dimensional case, discussed in § 1.5. Also, the same boundary con¬ 
ditions apply, namely, Z(0) = X{L) = 0, so we may write down the solution 
immediately. The eigenvalues, or permitted discrete values of E x , are 
given by 

£* = ^n 2 , n x = 1,2,3,..., (2.36) 


and the corresponding normalized eigenfunctions are given by 



(2.37) 


The solutions for Y and Z are of exactly the same form, so that the wave 
functions of the stationary states are 




n x n y n z 



. n y ny . 
sin —— sin 


n z nz 

, 

L 


(2.38) 


nd the energy levels, according to (2.34), 

h 2 n 2 , , 


are 


- l n x tt y n z 


(nl + n 2 + n 2 ). 


(2.39) 


where n x> n y , n z are integers. The wave functions are, of course, zero outside 
the box. It should be noted that changing the sign of n x , n y or n z does 
not change the energy, nor does it result in a different wave function, 
but merely multiplies (2.38) by -1. All the stationary states are therefore 
given by the positive integral values of n x , n y , n z . Also, none of these may 
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be zero, for this would make the wave function identically zero, which is 
not permitted. 

It may easily be verified that the wave function (2.38) is normalized: 
equation (2.24) becomes 



Three numbers, namely, the integers n x , n y and n z , which are called 
quantum numbers, are required to specify completely each stationary state. 
It should be noted, however, that the energy E depends only upon the sum 
of the squares of n x , n y and n z . Consequently, there will in general be several 
different stationary states, or different wave functions, having the same 
energy. For example, the four stationary states whose values of ( n x , n y , n z ) 
are (5, 1, 1), (1, 5, 1), (1, 1, 5) and (3, 3, 3), respectively, all have the energy 
21h 2 n 2 /2mL 2 . Such states and energy levels are said to be degenerate. 
On the other hand, if there is only one wave function corresponding to a 
certain energy, the state and the energy level are said to be non-degenerate. 
As an example of the latter we may take the ground state, that is, the 
state with quantum numbers (1, 1, 1) — the energy of this state is 
3h 2 n 2 /2mL 2 , and no other state has this energy. 

It may easily be seen that any linear combination of several degenerate 
wave functions is still a wave function belonging to the same energy. Thus, 
if ij/ 1 and i j/ 2 are wave functions of states with the same energy E, then 

ETipi = Erp!, (2.41) 

and 

HiJ/ 2 = Eil/ 2 . (2.42) 

Multiplying (2.41) and (2.42) by the arbitrary constants A and B, respectively, 
and adding, we obtain 

H(Ail/ 1 + B\l/ 2 ) = E(Aip 1 + Bip 2 ), (2.43) 

showing that the arbitrary linear combination A\]/ 1 + B\j/ 2 is a wave function 
corresponding to the energy E. The number of independent wave functions 
(that is, wave functions which are not merely linear combinations of the 
others) belonging to an energy level is called the degree of degeneracy of 
the level. 
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2.4. Particle in a Central Field 

As a second important example we will consider a particle moving in a 
central field of force, that is, under the action of a force which is directed 
towards a fixed point and whose magnitude is a function only of the distance 
r of the particle from the point. The potential energy of the particle is then 
also a function of r only, and may be written V(r): it is said to be spherically 
symmetrical. 

The Schrodinger equation is 

VV+ [£- F(r)# = 0. (2.44) 

If we take the centre of force to be at the origin, then 

r 2 = x 2 +y 2 +z 2 , 

so that we may write the Schrodinger equation in 
coordinates of the particle, as in equation (2.15). 

This is generally inconvenient, however, because 
equation (2.44) can rarely be separated in Cartesian 
coordinates. In fact, separable solutions can only 
be found in Cartesian coordinates when V(r) is 
constant (free particle) or when V(r) is proportional 
to r 2 (three-dimensional harmonic oscillator). 

In other cases, the spherical symmetry of the 
potential function suggests that we transform to 
the spherical polar coordinates (r, 9, </>). These 
are shown in Figure 2.2: r is the length of the 
radius vector, 9 is the angle between the radius 
vector and O z, and (f> is the angle between the projection of the radius 
vector on the xy-plane and Ox. It is easily verified that 

x = r sin 6 cos (f>, 

y = r sin 9 sin (f>, (2.46) 

z — r cos 9. 

The transformation of the Laplacian operator (2.16) into spherical 
polar coordinates is rather lengthy, and we will merely quote the result. 
The Schrodinger equation (2.44) becomes 


(2.45) 

terms of the Cartesian 



Fig. 2.2. Spherical polar 
coordinates. 
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11 
r 2 dr 



+ 


* 2 sin 6 dd 


(sin e 
\ de , 


1 d 2 ib 2 m rrn s, . 

+ -7-5--r H-7 [£- F(r)]^ 

r 2 sin 2 0 d(f ) 2 h 2 


= 0, 


(2.47) 


where \p is now a function of r, 0 and <f>. Separable solutions can now be 
found, of the form 

Mr, 0,4) = R(r)0(9)<P(4>), (2.48) 


R, 0 and <f> being functions of a single variable only. 

Substituting (2.48) into (2.47), multiplying through by r 2 sin 2 9/R0<P, 
and re-arranging, we obtain 


sin 2 9 d / 2 dR\ 
R dr V dr) 


+ 


sin 6 d 
0 dO 



+ 7 T [ £ “ V(r)]r 2 sin 2 9 = - . (2.49) 

n 0 d <p 

The right-hand side is a function of cf> only, while the left-hand side 
is independent of (f >. Both sides must therefore be equated to the same 
constant, which we denote by a: thus, 


d 2 0 
d <f> 2 


— <x<P, 


sin 2 9 d I 2 d R\ 
R dr\ dr) 


sin 9 d 


+- 

0 d 9 



(2.50) 


+ —r [E—V(rJ]r 2 sin 2 9 = oc. (2.51) 

h 


Dividing equation (2.51) through by sin 2 9, and re-arranging, gives 


1 d 
Rdr 



+ [£- V(r)y 


a 

sin 2 9 


_ 1 _ d^ 

0 sin 9 d 9 



(2.52) 


The left-hand side of this equation is a function of r only, while the right- 
hand side is a function of 9 only. Hence, both sides must be equated to 
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the same constant, which we denote by /?. After slight re-arrangement, 
the separated equations become 


1 d 


sin 9 


sin 9 d0 


d 2 R 2d R 

- — +-h 

d r r dr 


del \ 


dd 

2m 

U 7 


P 


sin 2 0. 


0 = 0 , 


{£-K(r)}-4 


r 2 j 


R = 0. 


(2.53) 

(2.54) 


It should be noted that, of the three separated equations (2.50), (2.53) 
and (2.54), only equation (2.54) contains the potential function V(r). It 
follows that the radial function R(r ) is the only one which depends upon 
the form of the central field; the angular functions 0(9) and <P(</>) are the 
same for all central fields. 

Two separation constants, a and /?, have been introduced, about which 
nothing has been said so far. If ^ is to satisfy the usual conditions, however, 
we might expect that these constants will not be able to assume arbitrary 
values. This is most easily seen in the case of a. Two independent solutions 
of equation (2.50) are 

<P = e iV5 *, (2.55) 

and 

<P = e _i ^. (2.56) 

An alternative pair of independent solutions (linear combinations of the 
previous pair), which is sometimes more convenient, is 

<Z> = sin \/a</>, (2.57) 

and 

<P = cos (2.58) 

Now, the point (r, 9, 4>) is the same as the point (r, 9, 0 + 2tc). Hence, if the 
wave function is to be single-valued, as required, 0 must not change in 
value when </> is increased by 2n, that is, 

<*>(</>+ 20 = <*>(</>), (2.59) 

or $ must have period 2n. In order that this be so, we must have 

vet = m t , (2.60) 

where m, is a positive or negative integer or zero. We may therefore take 
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the (un-normalized) eigenfunctions of equation (2.50) to be 

= eim, *> ( 2 - 61 ) 

where w ( = 0, ±1, ±2,..., and it must be remembered that the positive 
and negative values correspond to different eigenfunctions. Alternatively, 
we may take them to be 

= sin when m l = 1, 2, 3,. .., j 

$ m ,(0) = cos m^, when m, = 0, -1, -2,. . .. ) 

We do not propose to solve equation (2.53) generally, as this is rather 
tedious. It is found that, for 0 to be finite everywhere, we must have 

£ = /(/+1), (2.63) 

where / = 0, 1, 2,..., and, in addition, 

N| /, (2.64) 

The acceptable solutions for 0 are associated Legendre functions of cos 9, 
generally denoted by P ? 1 (cos 9). 

Equations (2.53) and (2.54) have now become 


1 d 
sin 9 d0 




+ 


1 )— 


d 2 R 2 d R 

-r H-h 

dr r dr 



R = 0 


(2.65) 

( 2 . 66 ) 


The latter equation cannot be solved in general, since its solution depends 
upon the form of F(r), and indeed there are only a few forms of V(r) for 
which analytical solutions can be obtained. However, we expect to obtain 
eigenfunctions corresponding to a series of discrete values of E, which we 
may label by means of some quantum number n, say. Since the equation 
contains /, it is clear that the eigenfunctions, and generally the eigenvalues, 
will also depend upon /. 

Summarizing, then, a wave function corresponding to the energy level 
E„, is specified by three quantum numbers n , /, m h and may be written 

d ’ <t>) = Kt{r)P?{oos 0)e im '*. (2.67) 

The states must usually be degenerate, for the energy does not depend 
upon the quantum number m t — m t does not occur in equation (2.66), 
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which determines the energy. Equation (2.64) tells us, in fact, that there 
are at least 2/+1 states to each energy level, since m, may assume the 
21 +1 values 


-/, — (/— 1 ),... - 1 , 0 , 1 ,. . . (/— 1 ), /. ( 2 . 68 ) 

We shall see below that in the special case of the hydrogenic atom, the 
degeneracy is greater than this. The significance of the quantum number 
n, called the principal quantum number, will be brought out more clearly 
later. / is called the azimuthal quantum number, since 6 is the azimuthal 
angle, m, is called the magnetic quantum number, because it is most im¬ 
portant in problems involving atoms in magnetic fields. 

States for which / = 0 are called s states, those for which 1=1 are 
called p states. These are the states which will concern us most later in the 
book, so it will be as well to consider them in detail, 
s states. When / = 0, equation (2.64) tells us that m, = 0 also, so that 
the function ^ is a constant. Equations (2.65) and (2.66) then become 



d 2 R 

dr 7 


2 d R 2m r ^ T ,. ^ 

+ —[£-F(r)]R = °. 

r dr n 


(2.70) 


An obvious solution of equation (2.69) is 


0 = constant, 


(2.71) 


and this is the only solution which is finite everywhere. It follows that the 
wave function of an s state is spherically symmetrical — it is, in fact, just 
an eigenfunction of equation (2.70). Conversely, if we substitute i j/ = R(r) 
in the Schrodinger equation (2.47), it reduces immediately to equation 
(2.70), and comparison with equation (2.66) then tells us that / = 0. Hence, 
all spherically symmetrical states are s states. 

p states. When / = 1, equation (2.64) permits m, to have the values 
— 1,0, 1, so that there are three degenerate wave functions. We say that 
p states are triply or threefold degenerate. With m x = +1, equation (2.65) 
becomes 


1 d^ 
sin d d 6 




0 = 0 , 


(2.72) 
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which has the finite solution 

0 = sin 9, (2.73) 

as may be verified by substitution. With m t = 0, the equation becomes 


1 


sin 9 d 9 

which has the finite solution 


sin 9 —) +20 = 0, 
d 9/ 


(2.74) 


0 = cos 9. 


(2.75) 


If we take the functions <P given by (2.61), therefore, the three degenerate 
p states corresponding to a given principal quantum number n, or energy 
level E n< i, are 

•A*,i,-i = -R„ ( i(r)sin 0e _i * ) 

•An, i,o = K i(r)cos0, (2.76) 

•An, 1,1 = R n, iMssinfle 1 '*, j 


where i? n>1 (r) is the eigenfunction of the equation 


d 2 R 2 d R 

-r 4-h 

dr r dr 


2m 

J? 


{£ - no) - 



R = 0 , 


(2.77) 


with E = E n l . 

Alternatively, we could take the functions <P given by (2.62), and obtain, 
instead of (2.76), 

•A*, i,-i = i?„,i(r)sin 0cos </>, ) 

' i ' n , 1,0 = R„,t(r)cos9, (2.78) 

*A n ,i,i = sin 9 sin <f>. J 

Recalling equations (2.46), we obtain, in more compact form 


•A*, i,-i = xf(r), \ 

i,o = z/(r), (2.79) 

'I'n, i, i = >’/(/•), J 

where 

ZOO = ^,,i//-- (2.80) 


Since the Schrodinger equation (2.44) is symmetrical in x, y and z, we 
should, in fact, expect that, if any one of the above functions is a solution. 
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then the other two must be solutions corresponding to the same energy. 

It should be noticed that the function xf(r ) is zero over the plane x = 0, 
and the other two functions are zero over the planes z = 0, y = 0 res¬ 
pectively. Such planes are called nodal planes. Thus, a p state, with 1=1, 
has one nodal plane passing through the origin. An s state, on the other 
hand, with / = 0, is spherically symmetrical, and has no nodal planes. In 
general, a state with azimuthal quantum number / has / nodal planes 
passing through the origin. Of course, the radial factor in the wave function 
will also generally give rise to nodal surfaces, that is, surfaces on which 
the wave function is zero, but these will be nodal spheres , centred at the 
origin. The principal quantum number n is defined so that n— 1 is the 
total number of nodal surfaces , both plane and spherical. Since there are / 
nodal planes, it follows that 

l^n-L (2.81) 

States with higher values of / are also denoted by letters; thus, 

1 = 0 1 2 3 4. 

s p d f g 

We shall not discuss f and g states, but it might be worth while to give a 
brief mention to d states, as they are of some importance in the theory 
of metals. 

d states. When / = 2, equation (2.64) permits m, to have the values —2, 
— 1,0, 1, 2, so that d states are fivefold degenerate. The five independent 
wave functions may be taken to have the forms 

yzF(r), zxF{r), xyF(r), 

(y 1 — z 2 ) F(r), (z 2 — x 2 ) F(r), 

F(r) being an appropriate radial function. It will be noted that each of 
them has two nodal planes (for example, the first is zero on the planes 
y = 0, z = 0) passing through the origin. 

Since the energy is not affected by the value of m, (in the absence of a 
magnetic field) it is generally sufficient to refer to a state by its values of n 
and /, and this is done by writing the value of n followed by the letter 
denoting the value of/. Thus, the ground state, with n = 1, / = 0, is called 
a Is state; a state with n = 3, / = 1, is a 3p state; a state with n = 3, / = 2, 
is a 3d state, and so on. 


40 


BASIC PRINCIPLES OF WAVE MECHANICS 


[Ch. 2, § 5 


2.5. The Hydrogenic Atom 

The foregoing discussion applies quite generally to all central fields. 
We will now consider a special case of particular importance—the hydrogenic 
or hydrogen-like atom. This is an atom which consists of a nucleus and a 
single electron, typical examples being the hydrogen atom itself, the singly- 
ionized helium atom, the doubly-ionized lithium atom, and so on. Ionization 
means the removal of an electron or electrons — a neutral atom of helium 
has two electrons, and when one of these is removed we have a singly- 
ionized helium atom. Similarly, a neutral atom of lithium has three electrons, 
and when two of these are removed we have a doubly-ionized lithium 
atom. Hydrogenic atoms of various elements differ from each other only 
in the charge and mass of the nucleus. We will denote the charge of a 
proton by e, so that the charge of an electron is — e. Then, if the atomic 
number (the number of electrons in a neutral atom) of an element is Z, 
the nuclear charge is Ze, so that the attractive force between the nucleus 
and an electron is of magnitude Ze 2 //- 2 , where r is the distance between 
the two. The potential energy of the electron in the field of the nucleus is, 
therefore, 

V(r) = - — , (2.83) 

r 

taking the zero of potential energy to be at infinite separation. 

Now, the general motion of a pair of attractive particles can be represented 
as an orbital motion of the particles about their common centre of mass 
plus a uniform linear motion of this centre of mass. However, the mass 
of a proton is 1836 times that of an electron, so that, even in a hydrogen 
atom, whose nucleus consists of a single proton, the centre of mass lies very 
close to the nucleus, and in all other atoms, whose nuclei consist of a number 
of protons and neutrons (the mass of a neutron is the same as that of a 
proton, but a neutron has no charge) it lies much closer still. It is clear, 
therefore, that little accuracy will be lost if the orbital motion of the nucleus 
is ignored and the electron is regarded as revolving about the nucleus as 
centre of force. Also, in the case of a free atom, the linear motion of the 
centre of mass is not quantized, that is, it does not give rise to a discrete 
series of energy levels, and when the atom is bound, as in a solid, so that 
the motion of the centre of mass is quantized, this motion can be treated 
quite separately from that of the electrons. Hence, for many purposes, the 
motion of the centre of mass, which we have assumed to be approximately 
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coincident with the nucleus, may be ignored, and we may take the problem 
of the hydrogeme atom to be that of an electron moving in the central 

e d of a fixed nucleus, with potential energy given by (2.83). All this can 
be shown quite rigorously, of course, and indeed the difficulty regarding 
the orbital motion of the nucleus can be completely overcome by replacing 
the electronic mass m by a reduced mass mM/(m + M), where M is the mass 
of the nucleus. We will not go into this question further, however, and 

as the reduced mass for hydrogen is 0.9995 m we will ignore the difference 
between this and m. 

The radial equation (2.66) can be solved analytically when V(r) is given 
by (2.83), but we shall not require the general form of the eigenfunctions, 
and so will again content ourselves with one or two special cases. 

a) n — l, o • This Is state has no nodal surfaces, either plane or spherical, 

and, as we might expect, it is the ground state of the atom. The un-normal- 
ized wave function is 


<Ai,o.o(r) = R lf0 (r) = e" 2 ^, 

where a 0 = h 2 /me 2 , as we will now verify. 

R must satisfy the equation 


Now 


and 


The terms in r cancel, and we are left with 


F Z2yne4 r 


(2.84) 


d 2 R 

2 dR 

2m / 

Ze 2 \ 


dr 2 

+-+ 

r dr 

—r- (£ + 

h 2 \ 

-)* = a 

(2.85) 

dR _ 
dr 

■ — — e ~ z 't a o _ _ 

Zmfi2 p -Z r/ao 

h 2 

(2.86) 


d 2 R i 

'Zme 2 \ 2 

-Zr/ao 



dr 2 ~ \ 


(2.87) 

.85), and dividing through by R, gives 


Zme 2 \ 

1 2 2 Zme 

2 2 ml 

' Ze 2 \ 


»* J 

h 2 r 

- + f( 

o 

II 

il " 

+ 

Kl 

(2.88) 


(2.89) 
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and (2.84) is not a solution unless E has this value, which is thus the lowest 
energy level of the atom. 

b) n = 2, / = o. This 2s state has a single nodal sphere, but no nodal planes, 
and its energy level is the next above the ground state level — it is the first 
excited state. The un-normalized wave function is 


<h,o.o( r ) = Ri,o(r) - (2» 0 -Zr)e- z " 2 “». (2.90) 


The equation which R must satisfy is again equation (2.85), but, following 
the above procedure, this time we find that it will only do so if 


E 


Zhne 4 J_ 
2h r 2 1 


J 2 t 0 * 


(2.91) 


c) n = 2, / = l. There are three degenerate 2p states, all with the same 
radial function 


R 2 ,i(r) = re~ Zr/2a °, 


(2.92) 


and angular functions given by (2.76) or (2.78). The only zero of R is at 
the origin, so that a 2p wave function has no nodal spheres, but one nodal 
plane. 

In this case R must satisfy the equation 


d 2 _R 
d? 


+ 


2d R [2m ( 

-+ -r\E+ 

r dr L n \ 


Ze- 


r 2 J 


R = 0, 


and it may be verified as before that it will only do so if 


E = 


Zme 4 1 


2 h 2 


1 = E 2.1- 


(2.93) 


(2.94) 


It will be noticed that £ 2>1 is the same as £ 2 .o> so that the energy is 
independent of /, and the 2s and 2p states are degenerate. This is always 
true for a hydrogenic atom — the energy levels depend only upon the prin¬ 
cipal quantum number n, and, as suggested by the above expressions, are 
given by 


Z 2 ?ne 4 1 
2 h 2 n 2 ' 


(2.95) 


The states of the hydrogenic atom thus have a greater degree of degeneracy 
than do those of an electron in other central fields, for which the energy 
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levels vary with / as well as with n. We have seen in § 2.4 that, for every 
central field, owing to the degeneracy of the <P functions, there are 21+ 1 
degenerate states with the same values of n and /. In the hydrogenic atom, 
however, all states with a given value of n and with / having the values 
0, 1, 2,. . . (n— 1) are degenerate also. Since 

Z o (2f+l ) = n 2 , (2.96) 

the total degeneracy of a state with principal quantum number n is n 2 . 

Formula (2.95) gives results which can immediately be compared with 
experiment. For instance, if a hydrogen atom (Z = 1) makes a transition 
from an excited state with principal quantum number n to the ground state, 
the Bohr frequency rule [equation (1.6)] tells us that radiation of frequency 
v will be emitted, where 


hv = E n -E 1 



(2.97) 


Lines corresponding to these frequencies, for different values of n, as well 
as other series of lines due to transitions to final states other than the 
ground state, have been observed in the spectrum of hydrogen, in excellent 
agreement with the theoretical results. 


2.6. Electronic Charge Clouds 

The radial functions R n>l for states of low energy of the hydrogen atom 
are shown in Figure 2.3. Excluding the origin, each function has n—l— 1 
zeros. 

The usual probability interpretation may be given to \\p\ 2 , provided 
<A is normalized, that is, provided we include in ^ a constant factor s'uch 
that 


I 


\ij/\ 2 dr 




\ij/\ 2 r 2 sin 6 d</> 


'f'lfi 

-J o 


r z \R\ 2 dr\ |0| z sin0dd H # = 1. 


(2.98) 


There is, however, another interpretation, or perhaps another way of looking 
at the probability interpretation, which is often convenient. This is the 
charge cloud interpretation. Since the electronic charge is —e, then, 
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according to the interpretation of |»A| 2 as a probability density, the average 
charge density at any point must be -e\\p\ 2 . In many problems it is useful, 
if only as an intuitive aid, to picture the electronic charge as spread out 
in a cloud having charge density — e\\j/(r)\ 2 at position r. 



Fig. 2.3. Radial wave functions for states of low energy 
of the hydrogen atom. 

The charge clouds of s states are all spherically symmetrical. Those of p 
states, however, have a dumb-bell like appearance. Of course, all the charge 
clouds extend to infinity, but in the case of p states the principal con¬ 
centration of charge is in two lobes symmetrically placed on either side of 
the nodal plane. This may be shown as follows: from equations (2.79) 
and (2.80) we find that a typical p function is 

t = -R, 

r 


(2,99) 
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where R is the radial function, and we will assume that 0 is normalized. 
The charge density is thus 


-<# I 2 = -e 



( 2 . 100 ) 


and this is zero on the plane x = 0, is symmetrical with respect to'this 
plane, as well as about the x-axis, and has 
its greatest values on the x-axis. The 
angular factor (x/r) 2 may be represented 
by means of a polar diagram, as shown in 
Figure 2.4 — the length of the radius vector 
from the origin to the curve gives the value 
of the function in the direction of the 



Fig. 2.4. Polar diagram showing 
the angular variation of charge 
density for a p state. 


radius vector, and the same curve applies to all planes through the x-axis. 
The charge clouds of the other two p states have their lobes in the y- and 
z-directions respectively. 


Similarly, the charge clouds of states with higher values of /, d states, 
etc., have a greater number of lobes, corresponding to the greater number 
of nodal planes. 


2.7. Orthogonality of Wave Functions 


2.7.1. NON-DEGENERATE STATES 


If i an d 02 a re wave functions of a particle moving in three dimensions, 
corresponding to the different energy levels E l and E 2 respectively, then 

jV?0 2 dr = 0, (2.101) 


the integration being throughout all space, and the functions are said 
to be orthogonal. 

This may be proved in the same way as for the one-dimensional case. 
From equation (2.17) and its complex conjugate, assuming that the potential 
function V(r) is real, we have 
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Multiplying the first equation by i J/ 2 , subtracting from it the second mul¬ 
tiplied by ip*, and integrating throughout all space, gives 

JWv?-**VV 2 ) dr = ^ (£,-£,) (2.104) 

According to Green’s theorem (see Appendix 1) a volume integral such 
as that on the left-hand side may be transformed into a surface integral 
over the bounding surface, thus 

J(^VV?-^iVV 2 )dr = J(V2^* dS > ( 2 ‘ 105 ) 

where 8/dn denotes differentiation in the direction of the outward normal 
to the surface. In the present case the volume is the whole of space, so 
that the surface is infinitely large. Since the wave functions of bound states 
must vanish at infinity, in order to satisfy equation (2.23), the integrand 
of the surface integral must be zero, and consequently the integral is zero. 
The left-hand side of equation (2.104) is therefore zero, and, since E 2 ¥ z E l , 
equation (2.101) follows. 

To be quite accurate, we must point out that the vanishing of the wave 
functions at infinity is not in itself sufficient to ensure that the surface 
integral in (2.105) is zero. The use of the term ‘infinity’ always implies a 
limiting process — in this case we are integrating over a large closed surface 
and finding the limiting value of this integral as the surface swells indefinitely 
in all directions. Since the surface area itself is increasing, it is clear that 
the limit will be zero only if the wave functions tend to zero sufficiently 
rapidly. For example, if we take the surface to be a sphere centred at the 
origin, with radius r and surface area Anr 2 , then, in order that the surface 
integral vanish in the limit as r -> oo, the integrand must tend to zero 
more rapidly than l/r 2 . However, this need not worry us, because it can be 
shown that all bound state wave functions tend to zero with sufficient 
rapidity. 

Of course, it may not be necessary to take the integral (2.101) throughout 
all space. In the problem of the particle in a box, discussed in § 2.3, we have 
seen that the wave functions vanish at the surface of the box. The volume 
integrals may therefore be taken throughout the inside of the box only; 
the surface integral becomes an integral over the finite surface of the box, 
and is zero because the wave functions vanish there. The above difficulties 
then do not arise. When we come to discuss metals we shall reconsider the 
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problem of the particle in a box and impose boundary conditions different 
from those mentioned above. These new boundary conditions, however, 
which are called periodic boundary conditions , are still of such a nature that 
the surface integral in (2.105) vanishes over the surface of the box, so that 
the wave functions remain orthogonal. 

2.7.2. DEGENERATE STATES 

If and ^2 are different wave functions belonging to the same degenerate 
energy level, the foregoing proof, which demands that E 2 ^ E u breaks 
down, and the functions need not be orthogonal. On the other hand, the 
functions may still be orthogonal, and, if they are not, then linear com¬ 
binations of them, which are orthogonal, can certainly be constructed. 
This may be done in the following way. 

Let us assume that t/q is normalized, so that 



f|(Ai| 2 dr = 1, 

(2.106) 

and also that 


J #ty 2 dr = c, 

(2.107) 

where c is some 

constant. Now let us write 



^3 = c'I'i-'I'i, 

(2.108) 

so that 

J 

•A* iMr = cj J \l/*\j/ 2 dr = 0, 

(2.109) 


from (2.106) and (2.107). Thus, \J/ 1 and i(/ 3 are independent degenerate 
wave functions which are orthogonal (we proved at the end of § 2.3 that 
any linear combination of degenerate wave functions is a wave function 
belonging to the same energy level). This process may be extended to 
any number of degenerate wave functions, so that a number, equal to the 
degree of degeneracy, of independent orthogonal wave functions may 
always be found. 

2.8. The Representation of Dynamical Variables by Operators 

In this section we will introduce a further abbreviated notation, which 
will also be used in Chapter 11. We will let q stand for any one of the Car¬ 
tesian coordinates of the particles of a system, and p stand for the conjugate 
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momentum, and simply write f(q, p) for a function of all the coordinates 
and conjugate momenta of the system. 

We have seen in § 2.1 that the replacement of p by the operator 
(h/i)(d/8q) in the classical Hamiltonian H(q, p) leads to the Schrodinger 
equation 



( 2 . 110 ) 


We have remarked that q has really become the operator ‘multiply by q\ 
although this does not greatly affect the problem. For any stationary state, 
or eigenfunction ij/, the energy E is constant; it is said to be a constant 
of the motion. 

This formal derivation of the Schrodinger equation is not the only use 
to which the operational form of p may be put. Any dynamical variable 
which classically may be written as a function f(q, p) of the Cartesian 
coordinates and conjugate momenta of the particles of a system may be 
transformed into a quantum-mechanical operator in the same way as the 
Hamiltonian, and some information about the values of the dynamical 
variable may so be derived. 

For example, it sometimes occurs that the eigenfunction \j/ of equation 
(2.110) also satisfies the equation 



( 2 . 111 ) 


where X is a constant. In this case it is said that / and H have simultaneous 
eigenfunctions', the obvious interpretation being that the dynamical variable 
represented by f is also a constant of the motion, and that its value is X in 
the state i/l We will not prove this, but will accept it as a fundamental 
postulate of quantum mechanics. 

It should be clear that it is by no means necessary for equation (2.111) 
to be satisfied by an eigenfunction of the Schrodinger equation. In general, 
an operator J will act upon \J/ to give a function quite different from t/q 
and not just a constant multiplied by i/l 

As a simple example, let us consider a particle moving freely along the 
x-axis. The classical equation of energy in Hamiltonian form, taking the 
potential energy to be zero, is 


H ( x , P x ) = ~p 2 x = E. 


(2.112) 
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A solution of this is 


corresponding to the energy 


h d 

Px T ^ 9 

i dx 

(2.113) 

obtain the Schrodinger equation 


h 2 d 2 \j/ 

--V = Eli/. 

2m dx 2 

(2.114) 

II 

ru_ 

£ 

H 

(2.115) 

E = 

2m 

(2.116) 


(we may ignore the normalization difficulty which occurs for free particles, 
mentioned in a footnote in § 1.6.). 

Now, according to classical mechanics, the momentum p x is certainly 
a constant of the motion, as is the energy E, so let us see if we can find an 
equation like (2.111) with /equal to the momentum operator (2.113) and ij/ 
the function (2.115). We have, in fact, 


~ — e ikx = hkt ikx , 
i dx 


(2.117) 


which has the form of (2.111), with A = hk. H and p x thus have simul¬ 
taneous eigenfunctions, and we deduce that, quantum-mechanically also, 
p x is a constant of the motion, with value hk. This is not surprising, for 
the classical value of p x , obtained from (2.112) and (2.116), is also hk. 
It is easily seen, however, that 

xe ikx * Ae ikx , (2.118) 

for any constant A. Hence H and * do not have simultaneous eigenfunctions, 
and x is not a constant of the motion, again in agreement with classical 
mechanics. 

2.8.1. COMMUTATION RELATIONS 

By straightforward differentiation, we have 

d . . # 

— W = <A + x — , 
ox ox 


(2.119) 
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so that 


h d , n 
t — (x\p)-x 
1 Ox 


h dip 
i dx 



Using equation (2.113), this may be written 

p x x\p-xp x ip = ^\p, 
i 


( 2 . 120 ) 


( 2 . 121 ) 


where it must be remembered that p x is the operator representing the 
momentum. From (2.121) we obtain the operator equation 

h 

p x x-xp x = ~. (2.122) 

l 


The left-hand side of this equation is called the commutator of p x and 
*, and is generally denoted by [p x , x], so that we have the commutation 
relation 

0 *,*] = -. (2.123) 

l 

In general, for any two operators A and B, the commutator is defined by 


[A, B ] = AB-BA. 
On the other hand, we see that 


d , n dip 

— W) = y T-, 

ox ox 


(2.124) 


(2.125) 


since and y are independent variables, and consequently 


h d 
i dx 


h dip 

w-nrr 0> 


(2.126) 


or 


p x y-yp x = [p x ,y] = o. (2.127) 

Two variables, such as these, whose commutator is zero, are said to com¬ 
mute. We have shown that canonically conjugate variables, such as .v and 
p x , do not commute. In quantum mechanics the commutation relation 
(2.123) is, in fact, taken to be the definition of canonically conjugate coor¬ 
dinates and momenta, rather than Hamilton’s equations (2.12). 
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Equation (2.122) represents the fundamental difference between quantum 
mechanics and classical mechanics. According to the latter, the right-hand 
side would be zero. This equation may be taken, instead of the Schrodinger 
equation, as the basis of quantum mechanics. It was used, in fact, before 
the discovery of wave mechanics, and it is still the fundamental equation 
of the other branch of quantum mechanics, namely, matrix mechanics — 
in this case, however, p x is not a differential operator, but both p x and x 
are matrices. 

We may also obtain the condition that an operator / will represent a 
constant of the motion, that is, have simultaneous eigenfunctions with H, 
in the form of a commutation relation. The equations 

Hl P = E\p, (2.128) 

M (2.129) 

must be satisfied by the same \j/. Operating on both sides of equation 
(2.128) with /, we have 

/Hip =/E\p = Efp = EX xp. (2.130) 

Similarly, operating on both sides of equation (2.129) with H, we have 

Hf\p = HX\p = XH\p = XEip. (2.131) 

Hence, 

m = Hfip, 

or 

(fH-Hf)iP = 0, (2.132) 

from which we obtain the commutation relation 

[f,H]= 0. (2.133) 

In other words, any operator which represents a constant of the motion 
commutes with H. The converse is also true. 

2.8.2. ANGULAR MOMENTUM 

As a useful example of the representation of dynamical variables by 
operators, let us consider the angular momentum of a particle in a central 
field of force. According to classical mechanics, the angular momentum, 
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or moment of momentum, L about the origin is defined by the vector 
equation 

L = r a p, (2.134) 

where r is the position vector of the particle and p its linear momentum. 
In some system of rectangular Cartesian coordinates (see Appendix 1) 
L therefore has the components 


L x =yp z ~zp y , \ 

L y =zp x -xp z , > (2.135) 

L z = xp y -yp x . ) 

We have seen [equation (2.67)] that the eigenfunctions of H, that is, 
the solutions of the Schrodinger equation, may be taken to have the form 

= /?„»??'(cos (2.136) 


We will now show that these are also eigenfunctions of L z . 
Writing 

h 8 
i dy 

in the usual way, we obtain the operator 


h d 

Px = - 

i dx 


Py=~ — 


j _h( d d\ 
z i\dy y dx) ' 


(2.137) 


We must transform this to spherical polar coordinates. We have 


d _ dx d dy d dz 8 

8<j) 8(f) dx d(p dy d<p dz 


= — r sin 6 sin 6 -b r sin 6 cos (b — 

dx dy 

d d 

= -y —b x — , 
dx dy 

from equations (2.46). Hence, 


(2.138) 
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Operating with this upon the \J/ in (2.136), we obtain 

h dip 

~idi = ,,m * (2 ' 140) 

since <p only occurs in the factor exp (im,<£). This equation is of the same 
form as (2.111), with X equal to hm t . Thus, the z-component of the angular 
momentum is a constant of the motion, and has the value hm t , in the 
given state \p. Corresponding to any value of /, therefore, there are (21+ 1) 
possible values of L,, since there are (21+ 1) possible values of the magnetic 
quantum number m t . 

In the same way, by transforming to spherical polar coordinates, it may 
be shown that the operator representing the square of the total angular 
momentum, 

L 2 = 4 + 4 + 4, (2.141) 

satisfies the equation 

L 2 ip = ft 2 /(/+l).A, (2.142) 

again with \p given by (2,136 ). Thus, the angular momentum L has the 

constant magnitude hfl(l+\) in the state t p. For this reason / is often 
referred to as the angular momentum quantum number. 


2.9. The Average or Expectation Value of a Dynamical Variable 


If a large number of measurements of a dynamical variable / is made 
on a system in a state t p, the average or expectation value of/, which we 
will denote by </), for this state, may be defined as the mean of the measured 
values. Alternatively, the measurements may be made on a large number 
of identical systems all in the state »(/. 

From the interpretation of \ip\ 2 as a probability density, when \p is a 
normalized wave function, it follows, for example, that the expectation 
value of x for a state ip of a single particle is given by 


<x> 




(2.143) 


Both forms of the integrand are equally correct, but we prefer the latter 
for a reason which will appear below. The same applies to any function 
g of the coordinates only of any system: we have 
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<£> =JV*g<A dt, (2.144) 

the integration being throughout the configuration space of the system. 

As a simple example, let us consider the radial distance r of the electron 
in the ground state of the hydrogen atom. It may be verified that the nor¬ 
malized wave function is 




V(™o) 


>-r/a 0 


(2.145) 


that is, the function (2.48) with Z = 1 and normalized according to (2.98). 
Thus, 


<r> = j ip*r\pdr = —f re 2r/, °°d r 
J nalJ 

= 4pe- 2r/a °d r = in, 

O'Q J 0 


(2.146) 


According to the early Bohr theory of the hydrogen atom, the electron 
in the ground state was supposed to describe a circular orbit of radius 
a 0 , so that r was equal to a 0 (see also Problem 2.3, p. 355). 

Now let us consider what happens when the dynamical variable is a 
function of the momenta as well as the positional coordinates of the particles 
of a system. In this case the variable is represented by a differential operator. 
An obvious example is provided by the Hamiltonian or energy function. 
If the energy of a system in the state ip is E, then 


Hip = Eip, (2.147) 

and we know that the expectation value of H is E, since the energy is constant. 
This result follows, however, from an equation of the same form as (2.144); 
thus, 

<H} = j ip*Hipdx, (2.148) 

which is equal to E because of equation (2.147) and the normalization of ip. 
The H in the integrand is, of course, a differential operator. It is not difficult 
to see that a similar result must hold for any quantity which is a constant 
of the motion. 

The foregoing suggests that the expectation value of any dynamical 
variable should be given by an equation like (2.144), whether the operator 
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which appears in the integrand is a function of position only or a dif¬ 
ferential operator. We will not attempt to prove this, but will take it as a 
further fundamental postulate of quantum mechanics, to be justified by 
results. The formal statement of this postulate is as follows: 

the expectation value of a dynamical variable /, represented by the operator 
fop > for a state whose normalized wave function is i {/, is given by 

</> =jV/o P </' d T, (2.149) 

the integration being throughout the configuration space of the system. 
Here, to remove all ambiguity, we have denoted the operator by/ op , but 
it is customary to use precisely the same symbol to denote both the dynamical 
variable and the operator which represents it — as, indeed, we have already 
done in several places. 

For example, let us take the x-component of the momentum of a particle 
in the state i j/. We have, in general, 

<P X > = JV* y ^ dr. (2.150) 


Now, more specifically, in the case of the ground state of a particle in a 
one-dimensional box, whose normalized wave function, obtained from 
equation (1.38), is 


<A = 



5 


within the box, we find 


<P X > = 


2nh C L . nx nx , 

-7- sin — cos — dx = 0, 
L 2 i J 0 L L 


(2.151) 


Thus, the expectation value of the momentum is zero for this state, in 
agreement with classical mechanics, according to which the particle would 
move to and fro between the walls with constant speed. The same is true 
for all states of this system. 


2.10. The Uncertainty Principle 

Another principle which emphasizes the distinction between classical 
and quantum mechanics is the uncertainty principle of Heisenberg. This 
states essentially that it is impossible to specify simultaneously, with 
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unlimited accuracy, both the position and the momentum of a particle. More 
expressly, if Ax is the uncertainty in our knowledge of the x-coordinate 
of a particle, and if Ap x is the uncertainty in our simultaneous knowledge 
of the conjugate momentum p x , then 

Ax ■ Ap x > h (2.152) 

— in other words, the product of the uncertainties in x and p x must be 
greater than a quantity of the order of magnitude of h. There are two 
further uncertainty relations, like (2.152), for the other coordinates and 
conjugate momenta of the particle — also, in § 11.5, we shall consider yet 
another such relation, involving energy and time. 

The uncertainty relation (2.152) can be derived from the principles 
adopted in the foregoing sections, and incidentally it can be made more 
precise, if we define Ax and Ap x more exactly. Several definitions are pos¬ 
sible, but generally the uncertainty in a variable is taken to be the root- 
mean-square deviation from the mean, the latter being the expectation value 
defined in the previous section. Thus, 

{Ax) 2 = <(x-<x» 2 > 

= <(x 2 — 2x<x> + <x> 2 > = <x 2 > <x> 2 , (2.153) 

since <x<x» = <x> 2 . 

Similarly, 

{Ap x y = <{p x -<p x » z > = <p 2 x y-< Px > 2 . ( 2 . 154 ) 

For simplicity, we will restrict ourselves to a one-dimensional system, 
but the generalization is immediate. For a state whose normalized wave 
function is if/ we have 

<x) = J \p*xij/dx, 

<p x y = t f •a* — dx, 

1 J dx 

the integrals being over all values of x. 

Now, the inequality 

dif/ 2 

— -b(v4x 4- B + iC)ij/ ^ 0 
dx 


<x 2 > = 


il/*x 2 \pdx, 


<p x y = -ti 1 »a* f dx, 

dx 


d 2 ip 


(2.155) 


(2.156) 
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is obviously true, since the squared modulus of a complex number cannot 
be negative. We will take A, B and C to be real constants, to which we will 
ascribe values later. We may then write (2.156) in the form 

“d ij/* 


[“T + ( Ax+B ~' lC )'l / * —■ +{ Ax +B + iC)\1/ 


di//* di J/ 


dx 


f x + ( Ax+B ){ 


*&+*•**) +iC L*r 

dx d x/ \ dx 




+ A 2 x 2 if/*if/+(B 2 + C 2 )il/*i}/ + 2ABxiJ/*i{/ ^ 0 . 

Integrating over all values of x, we obtain 

f#*ch/^ , „ f„ dr 

J dx dx 


(2.157) 


d X + A\ x ^^.+^^) dx 


dx 

dij/* 


dx 




dx 


+ A 2 j \j/*x 2 \p dx + (B 2 + C 2 )j if/*if/dx 
+2AB \j/*x\j/dx ^ 0. 


(2.158) 

Elementary integration by parts, using the fact that i j/ vanishes at both 
limits of integration, yields the following expressions: 


•A 


dxjj* 

dx 


dx 


d V 

dx 2 

dx, 

(2.159) 

d<A , 

— dx, 
dx 

(2.160) 

-J 

\J/*\J/dx = — 1. 

(2.161) 


Substituting in (2.158), and using equations (2.155), we find 

fl ~ 2 <pl>-A + 2Ch~ 1 (p x ')+A 2 <x 2 ) + B 2 + C 2 + 2AB(x> ^ 0. (2.162) 

This is true for any real values of A, B and C. Let us therefore set 
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A = - B = 

2 (Ax) 2 2 (Ax) 2 



<P X > 


h 


Then (2.162) becomes 



or 



from which it follows that 


Ax • Ap x ^ h/2. 


(2.163) 


Thus, the more accurately we measure x, the less accurately can we 
simultaneously measure p x . In particular, if we measure x precisely, we can 
obtain no information about the simultaneous value of p x , and vice versa. 
It must be realized that this is not due merely to the crudity of our present 
experimental apparatus and techniques. It is now generally accepted that 
the uncertainty principle is a fundamental law of quantum mechanics, 
and is due to the necessary and unavoidable interaction between observer 
and observed whenever an observation is made. Indeed, we have derived 
(2.163) from the interpretive postulates of quantum mechanics, without 
reference to experimental techniques, so that a denial of the necessity of 
the uncertainty principle would entail a rejection of these postulates. 
None the less, there are those — and Einstein was foremost among them 
— who are not convinced that the uncertainty relations impose an essential 
limitation upon our detailed knowledge of a single system. 

Although the uncertainty relations apply to macroscopic bodies as well 
as to elementary particles, it is clear that they are only of importance in 
atomic problems, owing to the small size of h. The degree of uncertainty 
involved in the simultaneous measurement of the position and momentum 
of a cannon ball, as a result of the uncertainty principle, is quite irrelevant 
to even the most sensitive experiment in ballistics. 


2.11. The Helium Atom 

We will end this chapter with an illustration of some of the difficulties 
which arise when a system contains several interacting particles. The 
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simplest atomic system of this kind is the helium atom, which has two 
electrons. Admittedly, the hydrogen atom consists of two interacting 
particles — a proton and an electron — but we saw earlier that in this 
case the problem can be reduced to that of a single particle moving in a 
central field of force — in fact, it was a sufficiently 
good approximation to take the proton as fixed. 

In the present case it is an even better approximation 
to assume that the nucleus is fixed, for the nucleus 
of a helium atom consists of two protons and two 
neutrons, and so has a mass four times that of the 
hydrogen nucleus. We will therefore consider the 
problem of two electrons, each of charge -e and 
mass m, moving in the field of a fixed nucleus of 
charge 2e. 

As shown in Figure 2.5, we denote by r x and r 2 the position vectors of 
the two electrons with respect to the nucleus, and by r 12 the distance between 
them. The total potential energy of the system is then 



Fig. 2.5. The helium 
atom: a nucleus, of 
charge 2e , and two 
electrons. 


H r i> r z) 


l£_ _ 2 ^ e 2 

r l r 2 r 12 


(2.163) 


the first two terms being the potential energies of the electrons in the field 
of the nucleus, and the last term the potential energy of the electrons due 
to the repulsive force between them — the zero of potential energy is 
thus taken to be when the electrons are infinitely separated from each 
other and from the nucleus. From equation (2.22) we then obtain the 
Schrodinger equation 


2m 




2e 2 


r i 


2* £ 

r i r l2 \ 




(2.164) 


where E is the total energy. The wave function \J/(r 1 ,r 2 ) is, of course, a 
function of the positional coordinates of both electrons. 

Equation (2.164) is difficult to solve, and indeed cannot be solved analytical¬ 
ly, owing to the presence of the term e 2 /r 12 in the potential energy function. 
Let us, as an approximation, drop this term — in other words, let us assume 
that the electrons do not interact. Equation (2.164) can then be separated 
into two equations of hydrogenic form. We may write 


^x,r 2 ) = iA fl (i' 1 )iA 6 (r 2 ), 


(2.165) 
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and 


E = s a + s b , 


(2.166) 


where 



2m r t 


(2.167) 


and 



2m r 2 


(2.168) 


This is easily verified, because equation (2.167) multiplied by \J/ b> added 
to equation (2.168) multiplied by < J/ a , gives equation (2.164) with the term 
e 2 /r 12 omitted. Both equations (2.167) and (2.168) are of the form of the 
Schrodinger equation for a hydrogenic atom, with Z = 2, so that the energy 
levels e a and s b for each electron separately are given by equation (2.95). 
The energy levels for the atom are obtained by summing these one-electron 
levels, as shown in equation (2.166). In particular, the lowest energy level 
of the atom is just twice the ground state energy of a single electron, that is, 


E = —4 me^jh 2 . 


(2.169) 


This amounts to —108.8 eV, whereas the experimental value is —79.0 eV. 
The error, of about 40 per cent, is due to the neglect of the Coulomb inter¬ 
action between the electrons, and it is therefore clear that this is not per¬ 
missible. Unfortunately, if this interaction is included, equation (2.164) 
cannot be separated. This difficulty persists in all problems involving 
systems of more than one electron. Approximate methods of solution must 
be employed, but better than that discussed above. We shall consider 
some of these in the following chapters. 


CHAPTER 3 


THE VARIATIONAL METHOD AND PERTURBATION 

THEORY 


3.1. The Variational Method 

Let us suppose that a system, which may consist of one or more particles, 
and whose Hamiltonian operator is H, has an infinite set of discrete energy 
levels, denoted by E t , and normalized wave functions >j/ t . Then 

H^i = E^i. (3.1) 

If we multiply each side of this equation by i j/f and integrate throughout 
the configuration space of the system, we obtain 

E t (3.2) 

since, owing to the normalization of t/^, 

J tf'iVidr = 1. (3.3) 

In particular, the lowest energy E u which is the energy of the ground 
state \{/ l , is given by 

Et =JV?H<Mt. (3.4) 

Now let us consider any function (f> of the positional coordinates of the 
system, which is not necessarily a wave function, but is single-valued and 
continuous, and is normalized, so that 

J 4>*4 dr = 1. (3.5) 

Then, the variation principle, which is the basis of the variational method, 
states that the integral 

E = \ 4>*H<j)dx (3.6) 
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is greater than or equal to the ground state energy E y of the system. Equality 
is only obtained if, in fact, (f> is the function i j/ y , so we may also say that 
the function 0 which minimizes the integral E is the ground state wave 
function of the system. 

The variation principle may be proved quite easily by expanding the 
function 0 in terms of the orthonormal set of functions t/q, assumed to be 
complete, as discussed in Appendix 2. No generality is lost by supposing 
that the functions i j/ t are orthogonal since, if any of these functions are 
degenerate, orthogonal linear combinations of them can always be con¬ 
structed by the method described in § 2.7.2. We thus have 

4> = I c • (3-7) 

i 

The a t are fixed constants, and, owing to the normalization of <fi and the 
orthonormality of the i/q, we find 

J = 11 a * a ij = 1, 

or 

= 1. (3.8) 

i 

Substitution of (3.7) in (3.6) gives 

E = Y,lL a j a i\ 'l / * H 'l / i dT 
» J J 

= 'Z'Z a * a i E i f 

i j * 

= Z a * a i E b (3.9) 

i 

owing to equation (3.1) and the orthonormality of the 
Now, if we subtract the ground state energy E y from both sides of equation 
(3.9), using equation (3.8), we obtain 

E-E y = 'Za*a i (E i -E 1 ). (3.10) 

i 

Since E x is the lowest energy of the system, E t is greater than or equal to 
it for all i, and a*a t , or |cr,-| 2 , cannot be negative. The right-hand side of 
equation (3.10) is therefore positive or zero, so that 

E ^ E v 


(3.H) 
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Furthermore, E is equal to E t only if all the are zero except a, - in 
other words, only if 0 is the function i 

If the function 0 is not normalized, then the variation principle applies 
to the integral 


£ = J 0*H0dt 
J 0*0dr 


(3M2) 


instead of (3.6), since the denominator normalizes the functions 0 and 0* 
in the numerator. Clearly, if 0 is normalized, (3.12) reduces to (3.6). 

The variational method, based upon a general method devised by Ritz 
in 1909, makes use of the variation principle to find approximations to 
the ground state energy and wave function of an atomic system. The function 
0, generally called the trial function, is chosen to be of an analytical form 
which is likely to be close to that of the true ground state wave function 0 l5 
but containing several undetermined parameters. The integral E, in equation 
(3.12), is evaluated in terms of these parameters, and their values are then 
chosen so as to minimize E. This minimum value will be the best ap¬ 
proximation to the ground state energy £, for the analytical form of 0 
which has been used. 

For example, if the chosen form of 0 is 


0 = e ar (l +pr+yr 2 ), (3.13) 

for a one-particle system, and this is substituted in equation (3.12), E is 
obtained as a function of the parameters a, ft and y. E is now minimized 
with respect to these parameters — in other words, those values of a, p 
and y are chosen which satisfy the equations 


dE dE _ 5E 
da d/3 dy 


(3.14) 


How this works in practice will be seen in the following sections. 

Of course, the analytical form of 0 may be chosen arbitrarily, without 
reference to the likely form of i(/ 1 , but then E will probably be a poor 
approximation to E l . To obtain the best results from the variational method 
it is desirable that the form of 0 be as close as possible to that of Where 
this is difficult to assess with sufficient accuracy, several different forms of 
0 may be tried — the one which gives the lowest value of E may be taken 
to be the best as far as the evaluation of the energy is concerned. 
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3.2. The Ground State of the Harmonic Oscillator 

We have already determined the ground state of the harmonic oscillator 
in § 1.8. However, in order to illustrate the use of the variational method, 
let us pretend to be ignorant of this fact. 

The Hamiltonian operator for this system [see equation (1.53)] is 


T h 2 d 2 , 2 2 

H =-- +{-mco 2 x. 


(3.15) 


Now, the potential function is symmetrical about the origin, so we expect 
the ground state wave function to be symmetrical about the origin. Also 
we expect this wave function to have no nodes and to tend to zero as x 
tends to + oo. The simplest function with these properties, which easily 
comes to mind, is 


(3.16) 


<p(x) = e““ 


where a is a positive constant, which we leave as our adjustable parameter. 
We have, then, 



(3.17) 


(see Appendix 4 for some useful integrals). 
Also, 



so that 




(3.18) 


since integration by parts gives 



(3.19) 
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Thus, from equations (3.17) and (3.18), 

E (a) = I _ mco 2 | ii 2 a 


J <f)*<pdx 8a 2m ’ 


(3.20) 
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To minimize this we choose a such that 


d£ 

da 



(3.21) 


This gives a = mco/2h, and the minimum value of E is hco/2. We have seen 
in § 1.8 that, with this value of a, the function <j> in equation (3.16) is, in 
fact, the true ground state wave function, and hw/2 is the ground state 
energy. 

In this case, then, owing to our fortunate choice of <f>, the variational 
method has provided the correct solution of the Schrodinger equation. 
This very rarely happens, because only a few systems have ground state 
wave functions of such simple analytical form that they can be guessed 
immediately. It should be noted also that the variational method itself 
does not tell us how close the value of E is to the lowest energy level. In the 
present instance, only by previous knowledge, or by substitution in the 
Schrodinger equation, can we deduce that we have obtained the true 
ground state wave function — as far as the variational method goes there 
might well have been other forms of <f) which would give lower values of E. 

3.3. The Ground State of the Helium Atom 

As a more useful example let us consider the ground state, or normal 
state, of the helium atom, which has already been discussed in § 2.11. The 
Hamiltonian operator for this system is 



(3.22) 


2 m 


which gives rise to the Schrodinger equation (2.164). 

Before proceeding with the problem it will prove convenient to introduce 
a new system of units, which will be used frequently throughout the book. 
In this system the unit of energy is the rydberg (ryd) and the unit of .length 
is the Bohr unit (B.u.), defined as follows: 


1 rydberg = me 4 /2h 2 = 13.60 eV, 

1 Bohr unit = h 2 /me 2 = 0.5292 A. 
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For brevity we shall refer to these as atomic units, although this name is 
generally applied to another system, introduced by Hartree, in which the 
unit of energy is two rydbergs. It may be seen that one rydberg is the energy 
of the ground state of a hydrogen atom, calculated on the assumption 
that the nucleus is fixed. One Bohr unit is the length we have previously 
denoted by a 0 , and is sometimes called the ‘Bohr radius’. 

If H, r u r 2 and r 12 are now expressed in terms of these units, we have, 
from equation (3.22), 



or 


H = -Vf-Vf- — - — + — . (3.23) 

r i r 2 r 12 

We have seen previously that, if the Coulomb interaction between the 
electrons, that is, the term 2/r 12 in equation (3.23), is ignored, the Schro- 
dinger equation is separable into two hydrogenic equations. The ground 
state wave function is then, in atomic units, 

0 = e~ 2{ri+r2) , (3.24) 

which is the product of two hydrogenic ground state functions. It has been 
remarked that the ground state energy, —8 ryd or —108.8 eV, given by this 
approximation is in error by some 40 per cent. However, let us see if we 
can improve on this by including the Coulomb interaction and using 
the variational method, with trial function 0 of the form, suggested by 
(3.24), 

0 = e“ a(ri+r2) . (3.25) 

Here a is a parameter, assumed real and positive, whose value we shall 
choose by minimizing the integral 

E ( a \ = 11 0*H0dr 1 dr 2 
II 0*0drjdr 2 

Since 0 is a real function, of course, 0* is the same as 0. 

Now, we have 


(3.26) 
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f 0 O /•« 

= e-^MjrrJdrj 
•'o J 0 


e 2ar2 47ir 2 dr 2 


= 16ti 2 2 2 


7l 


(2a) 3 (2a) 3 a 6 * 

For a spherically symmetrical function /(r) we may write 

V 2 f=—— (r 2 ^ 
r 2 dr l dr) 

from which it follows that 

V^e-^ 1 = ^a 2 — — j e -ari , 

with a similar expression for Vfe _ar2 . Hence, 

H<f>= - (« 2 - — j e -*fr,+r,)_ ^2_ ^ e -.(r, + „) 


(3-27) 


(3.28) 


= [- 


+ (-i-± + ±) 

\ r x r 2 r 12 / 


,-a(n+r 2 ) 


2j_ (2«- 4) ^ (2a—4) + _2_ 


2a + >-^ + 


r 12-1 


fc -a(n+r 2 ) 


(3.29) 


and 


JJ* (f)*H<l>dr i dr 

= j|r_ 2 a 2 + ^-e^)^^ 


+ 

>*1 ^2 


+ - 

rn-J 


e" 2a(rj+r2 ) d r ld r 2 . (3.30) 


We have 

IT i e ‘ 2a(ri+r2) dr 1 dr 2 = f°°l 

JJ r x Jo r, 


e 2 ® ri 47rr 2 dr 1 J* e" 2ar2 47tr|dr 2 
1 2 7T 2 


— 1671 2 


(2a) 2 (2a) 3 a 


5 ’ 


(3.31) 
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and it is shown in Appendix 4 that 


If; 


— e -2 *( ri+,2) dr 1 dr 2 = 

8a 5 


12 


Using these two integrals in (3.30), we obtain 


If 


571 2 


and substituting (3.33) and (3.27) in (3.26) gives 

E( a) = 2a 2 — — a. 


(3.32) 


(f>*H<f>d ri dr 2 = - 2a 2 - + 2(2a-4)- +2 — , (3.33) 

.0 „5 g a 5 


(3.34) 


To minimize this we set 


giving 


dE 27 . 

— = 4a-= 0, 

da 4 


27 

a = — , 

16 


and the minimum value of E for the chosen form of <fi is 




729 

128 


= —5.69 ryd. 


(3.35) 


(3.36) 


(3.37) 


The observed value of the ground state energy is —5.81 ryd, so that 
the calculated value is in error by only 0.12 ryd, or about 2 per cent. Even 
with quite a crude trial function, therefore, the method gives an excellent 
value for the energy. With more complicated forms of <j>, involving several 
parameters, the value of the ground state energy of helium calculated by 
the variational method has, in fact, been brought into almost perfect 
agreement with the experimental value. 


3.4. Linear Variation Functions 

A frequently useful form of trial function (ft is one which consists of a 
linear combination of definite functions, with coefficients which are treated 
as variational parameters. Let us consider just two such functions, 4> t 
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and <p 2 , so that 

0=ai0i+a 2 0 2 - (3.38) 

Also, for simplicity, let us assume that <f> lf 0 2 and the parameters a u a 2 
are all real, although the final result is the same if this is not so. It should 
be noted that and 0 2 need not be orthogonal or normalized. 

According to the variational method, then, we wish to choose a^ and a 2 
so as to minimize the function 

E(a t , .J = 

J <P dt 

= a i I <f>iH<f> l dx + a 1 a 2 J ^H^dT-Ha^ J 0 2 H cp^r + aj f <j) 2 H<f) 2 dr 
a i J <f>\dr + 2a i a 2 J 0 1 0 2 dt + a 2 J 0 2 dr 

(3.39) 

In order to simplify this equation we define the following symbols: 


H ij = J <PiH<pjdx, S u = J <pi<pjdr, 

(3.40) 

where i and j can have the values 1 and 2. It is obvious that 

*^12 — *$21 > 

(3.41) 

and it is easily shown that 


Hu =H 2l . 

(3.42) 

Thus, consider a single-particle system, whose Hamiltonian 

is 

H = - — V 2 + F(r). 

2m 

(3.43) 

Then, 


H 12 = <}> 1 V 2 <f> 2 dr + f faVfadr. 

2m J J 

' (3 44) 

If the subscripts 1 and 2 are interchanged in the last term, its value is clearly 
unaltered, since V is a function of the coordinates only. Also, it follows 
from the work of § 2.7.1, involving the use of Green’s theorem, that, provided 
0i and 0 2 satisfy the same boundary conditions as a wave function, 

jWiVV*— &VV,)dr =0. 

(3.45) 
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This proves equation (3.42) for a single-particle system, and the same is 
true for any number of particles. 

Equation (3.39) may now be written in the form 


(ft i -(- 2n 2 ci 2 S 12 "b Q 2 S 22 )E — tt iH 11 -b 2n 1 q 2 H 12 -H q 2 H 22 . (3.46) 

In order to minimize E we must choose cq and a 2 so that 


“-a ^ = 0. 

dci2 


(3.47) 


Differentiating equation (3.46) partially with respect to a u and using 
the first of equations (3.47), we obtain 


(a 1 5 , 11 +a 2 S' 1 2)£: = aiH n +a 2 H 12 . (3.48) 

Similarly, differentiating (3.46) partially with respect to a 2 , and using the 
second of equations (3.47), we obtain 

{a i S l2 +a 2 S 22 )E = OiH 12 +a 2 H 22 , 

or, from equations (3.41) and (3.42), 

(fliS 2 i-\- a 2 S 22 )E = a 1 H 21 +a 2 H 22 . (3.49) 

We prefer the latter form because it leads to a result which agrees with 
that of the more general theory, which we will discuss later. 

Equations (3.48) and (3.49) may be re-arranged as follows: 

a^Hu-ES^+a^H^-ESu) = 0, (3.50) 

a i(H 2 i ES 2 i) + q 2 {H 22 — ES 22 ) = 0. (3.51) 

Now, aq and a 2 cannot both be zero, for this would make <p zero, which 
is not allowed. Therefore, a l and a 2 may be eliminated from equations 
(3.50) and (3.51), by elementary algebra, to give 

(H ll -ES 11 )(H 22 -ES 22 )-(H 12 -ES l2 )(H 2l -ES 2l ) = 0, (3.52) 

which is equivalent to the determinantal equation (see Appendix 3) 

Hi ! — ESi j H l2 — ES 12 

= 0. (3.53) 

h 21 -es 21 h 22 — es 22 

$1 and (f )2 are definite functions, containing no undetermined parameters, 
so that Hij and S u are numbers which can be calculated. Equation (3.52) 
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is therefore a simple quadratic in E, and, according to the variation principle, 
its lower root is an approximation to the lowest energy level E x of the 
system, being greater than or equal to it. It can also be shown that its 
higher root is an approximation, although generally not so good a one, 
to the next higher energy level E 2 of the system. 

In general, if a linear combination of N functions is used, so that 

N 

4> = (3.54) 

a set of N linear equations, each with N terms like those in (3.50) and 
(3.51), is obtained. Since the a± may not all be zero, the determinant of their 
coefficients in these equations must vanish, as in equation (3.53). The 
equation for E is, therefore, 


Hn-ESn H 12 ~ES 12 ... H 1N -ES 1N 
H 21 ES 21 H 22 ES 22 ... H 2 H — ES 2N 

Hni ~ ES N1 H N2 -ES N2 .. . H nn -ES nn 


= 0 . 


(3.55) 


This determinant is a polynomial of degree N in E, and the lowest root 
of the equation is an approximation to the lowest energy level of the system. 
The other roots again are, generally less good, approximations to the next 
N— 1 energy levels. 

The proof of this general case, when the a t and 0, may be complex, is 
only slightly different from that given in the particular case considered 
above, and we will omit it. One point should be mentioned, however. In 
general, we define 

Hij = J <f>*H<t>jdT, S tJ = J 4>t4>jdr. (3.56) 

It can be seen immediately that here S u is not necessarily equal to S jh but 

Stj = Sj • (3.57) 

Furthermore, it can be shown by the method used to prove equation 
(3.42) that 

Hij = H* . (3.58) 


Owing to this property H is said to be a Hermitian operator. 

In this general case the element H 21 -ES 21 , for example, in the deter- 
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minant of equation (3.55), may not be written H 12 —ES 12 , which is 
permissible in equation (3.53). 

3.5. Perturbation Theory 

Another important approximational method is known as perturbation 
theory, because it is used mainly to calculate the change in energy of an 
atomic system when the system is perturbed, that is, when it is acted upon 
by some small external force, called a perturbation. For example, pertur¬ 
bation theory may be used to calculate the change in energy of a hydrogen 
atom when a small electric or magnetic field is switched on. The original 
state of the atom is called the unperturbed state, and the final state, in the 
presence of the field, is called the perturbed state. 

On the other hand, the unperturbed state may be purely fictitious. We 
may require to calculate the energy of a system whose Hamiltonian contains 
a small term which makes this very difficult. If we can calculate the energy 
levels and wave functions which the system would have without this awkward 
term, then perturbation theory can be used to calculate the additional energy 
due to the presence of this term, which is regarded as a perturbation. For 
example, we saw in § 2.11 that the solution of the Schrodinger equation 
for the helium atom would be a very simple matter were it not for the 
presence of the term e 2 jr l2 in the Hamiltonian, for without this term the 
equation separates into two hydrogenic equations. We may therefore 
calculate the energy levels and wave functions without this term, the Schro¬ 
dinger equation corresponding to no real system, and then use perturbation 
theory to obtain the corrections. The ground state of the helium atom will 
be considered in detail from this point of view below. 

Perturbation theory is mathematically rather complicated, and we will 
first of all consider only its most elementary form, called first-order pertur¬ 
bation theory, which will be sufficient for most of the purposes of the present 
book. A more general treatment will then be given, for the sake of its 
intrinsic interest and importance, but this may be omitted at a first reading, 
if desired. 

3.6. First-Order Perturbation Theory for a Non-Degenerate State 

Suppose that the Schrodinger equation for the state ¥' of some system is 

H'F = EV, (3.59) 
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and that the Hamiltonian H can be expressed as the sum of two parts, 

H = H°+H\ ( 3 . 60) 

the effect of H' on the energy levels and wave functions being small. Also 
assume that the equation 

H° \}/ = E 0 \p (3 61) 

has been solved, subject to the same boundary conditions as equation 
(3.59), and that the wave function i J/ is non-degenerate. H° is the Hamiltonian 
of the unperturbed system, H that of the perturbed system, and H' is the 
perturbation term. 

In order to avoid a complexity of subscripts and superscripts, which 
sometimes occurs in the more general theory, we have denoted the un¬ 
perturbed state by a small i {/ and the perturbed state by a capital f' — instead 
of considering all the states of the system, we focus our attention upon one 
of them, and assume that the state is that into which ^ is changed by the 
perturbation. 

If the perturbation is truly small, then V will not differ very much from i J/. 
According to first-order perturbation theory an approximation to the 
change in energy is obtained by substituting \j/ for W in the integral ex¬ 
pression for E; thus, approximately , 


E = l = J + dr 

J \j/*\j/dz J i/A^dt 


_ E o , J Wijfdr 


(3.62) 


from equations (3.60) and (3.61). In other words, the first-order perturbation 
energy, the second term on the right of equation (3.62), is obtained by 
averaging the perturbation term over the unperturbed state of the system. 
If i j/ is normalized, of course, the denominators in equation (3.62) reduce 
to unity. 

In some cases, owing to the symmetries of i p and H', the first-order 
energy correction vanishes. In this case it is necessary to go to second-order 
perturbation theory, which we shall discuss in § 3.8, in order to obtain a 
non-zero correction. 





74 THE VARIATIONAL METHOD AND PERTURBATION THEORY [Ch. 3, § 6 


3.6.1. THE GROUND STATE OF THE HELIUM ATOM 

The Hamiltonian of the helium atom, in atomic units, is given in equation 
(3.23). We will regard the Coulomb interaction term, 2 /r 12 , as a pertur¬ 
bation, so that, in the above notation, 

H° = — Vf — V|— — - —, 
r i r 2 

jr-A. 

ru 

As shown in §2.11, the Schrodinger equation for the unperturbed state, 

V = E° i/q (3.65) 


(3.63) 

(3.64) 


separates into two hydrogenic equations, and the unperturbed ground state 
wave function, as given in equation (3.24) in our present units, is 

\}/ = e- 2(ri+rj) . (3.66) 

The unperturbed ground state energy, obtained from equation (2.169), is 

E° = - 8 ryd. (3.67) 

The ground state energy of the perturbed system, according to first- 
order perturbation theory, is 


£ = jj i^*Hi/rdr 1 dr 2 

If ^t/'diqdrj 

= e°+ U t A 2 ( 2 / r i2)dy t dr 2 
H ^ 1 dr 1 dr 2 


(3.68) 


since t {/ is real. There is no need for us to calculate this, however, for the 
work has already been done in the application of the variational method 
to this system. The unperturbed wave function (3.66) has the same form 
as the trial function (f) in equation (3.25), with a = 2 — indeed, this is 
what suggested the form of the trial function. Thus, comparing equations 
(3.26) and (3.68), we see that all we have to do to obtain E in our present 
problem is to put a = 2 in the final expression for E{ a), equation (3.34). 
This gives 


E = 2(2) 2 —■ s ^-(2) = —5.5 ryd. 


(3.69) 
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This is quite a good approximation to the observed value, -5.81 ryd; 
the error being about 5 per cent. It is not so good as the result of the varia¬ 
tional method, however; which was to be expected, for ^ has the same 
form as </> but not the optimum value of a, namely, 27/16. 

The first-order energy correction is 

E—E° = —5.5 + 8 = 2.5 ryd. (3.70) 

This is by no means small — in other words, the Coulomb interaction is 
hardly a small perturbation — so it is not surprising that first-order pertur¬ 
bation theory does not give a more accurate result. 

3.7. First-Order Perturbation Theory for a Degenerate State 

Using the same notation as in § 3.6, let us now assume that the un¬ 
perturbed state \p is doubly degenerate ; that is to say, there are two in¬ 
dependent wave functions i \jt u \j/ 2 corresponding to the same unperturbed 
energy E°, so that 

H °ti = £°</T> (3.71) 

and 

H °'l' 2 = E°\J/ 2 . (3.72) 

In order to simplify the algebra slightly we will assume that \J/ 1 and \j/ 2 
are orthogonal and normalized. No generality is lost in this way, because 
we have seen in § 2.7.2 that, if the degenerate wave functions are not at 
first orthogonal, linear combinations of them which are orthogonal can 
always be constructed. 

Now, any function of the form 

iA = a^i+^iAj, (3.73) 

where a t and a 2 are constants, is also a wave function of the unperturbed 
system corresponding to the energy E° — clearly, by multiplying equations 
(3.71) and (3.72) by a x and a 2 respectively, and adding, we obtain 

H°Mi+a 2 il/ 2 ) = E^a^y+a^J. (3.74) 

In the non-degenerate case we obtained an approximation to the energy 
E of the perturbed system by assuming that its wave function 9 was the 
same as that of the unperturbed system. Let us do the same in the present 
case — in other words, let us assume that the perturbed wave function 
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is given as a first approximation by (3.73), the values of a, and a 2 remaining 
undetermined for the present. 

We will denote the perturbation energy by E', so that 

E = E° + E’. (3.75) 

Then, as a first approximation, we assume that if/ satisfies the equation 

H\j/ = E\j/, 


or 

(H° + H')(a 1 il/ 1 +a 2 il/ 2 ) = (E° + E')(a 1 xl/ 1 +a 2 i// 2 ). (3.76) 


Using equation (3.74), this reduces to 

(H'-E’Xarfi+aM = 0. (3.77) 

We now multiply this equation by ij/ 1 and integrate over the configura- 
tion space of the system, and then do the same thing with \p 2 , obtaining 
the two equations 

J + dx = 0, (3.78) 

J ^(H’-E'Xa^. + a^^ dx = 0. (3.79) 

Notice that \j/* and \{/ 2 are kept to the left of H', which might be a dif¬ 
ferential operator. 

As in § 3.4, we define the following symbols: 

H'ij = jipfH'if/jdx, (3.80) 


in terms of which equations (3.78) and (3.79) become 

a 1 H' 11 +a 2 H , 12 — E'd! ji(/*\l/ 1 dx-E'a 2 |* \j/*il/ 2 dx = 0, (3.81) 

a iH' 2l +a 2 H 22 -E , a 1 J i]/l\]/ 1 dx-E'a 2 J i fy 2 dx = 0. (3.82) 

Owing to the orthonormality of i J/ l and t/^ 2 these equations reduce to 


+ = 0, (3.83) 

a \H 2 \d-a 2 {H 22 — E ) = 0. (3.84) 
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One solution would be a, = a 2 = 0, but this is not permitted as it would 
make ijr identically zero. In order that the same non-zero values of a t 

and a 2 may satisfy both equations, the determinant of their coefficients 
must vanish; that is, 


ff'u-tf n \ 2 
H' 2 i H’ 22 -E' 


The expansion of the determinant gives 


(3.85) 


(H'ii-E')(H 22 -E')-H' 12 H 21 = 0, (3.86) 

which may be verified by eliminating a x and a 2 from (3.83) and (3.84) 
by elementary algebra. 

The determinant in equation(3.85) is generally called the secular determinant 
for the perturbation, and the equation is called the secular equation. The 
latter is a quadratic in E' and will usually have two distinct roots. In this 
case the perturbation has split the doubly degenerate unperturbed level into 
two different levels, and we say that it has removed the degeneracy. On 
the other hand, the secular equation might have equal roots, in which case 
the perturbation has merely displaced the unperturbed level, and we say 
that the degeneracy is not removed to the first order by the perturbation. 

It is generally not required to find the values of a± and a 2 , but if necessary 
the two values of afa 2 corresponding to the two values of E' can be obtained 
directly from either of equations (3.83) and (3.84). The values of a x and a 2 
are not fixed separately by these equations, so that iff still remains ar¬ 
bitrary to the extent of a multiplying constant — which, of course, may 
be fixed by normalization. 

It is useful to notice that 


H'n-E' =JV?(ff-ff°>MT-E' 

= H n —£° —£' = H u -E, (3.87) 

and 

ff'12 =J iA?(tf-tf°)<M? 

= H 12 -E°j^ 2 dr = H 12 , (3.88) 

owing to the orthonormality of ij/ 1 and 1 j / 2 , with similar equations for the 
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other elements of the secular determinant. Thus, the secular equation may 
equally well be written 


H tl -E H l2 
H 2 1 H 22 — E 


(3.89) 


This is true in general: the secular determinant has the same form whether 
it is written in terms of the Hamiltonian H and energy E of the perturbed 
system, or in terms of the perturbation H' and energy correction E'. 

So far we have considered only a doubly degenerate unperturbed level, 
but the method is the same for an A-fold degenerate level. In this case we 
take the wave function to be 

* = Z a* , (3.90) 

i = 1 

where the t j/ t constitute an orthonormal set of degenerate functions cor¬ 
responding to the unperturbed energy E°, and the secular equation turns 
out to be 


H'u ~E' 

H \ 2 . 

• . H[ n 



H 2i 

H' 22 -E '. 

• • H' 2N 

= 0 . 

(3.91) 

Hi, i 

H ' N2 . 

. . H' nn -E' 




We will leave the proof of this until § 3.8. This equation is of the iVth 
degree in E' and has N roots which may not all be distinct, so that the 
degeneracy may not be completely removed to the first order. 

As in the doubly degenerate case, the secular equation has the same 


terms of 

H and E; 

thus, 

H^-E 

H l2 .. 

• h in 

H 2 1 

h 22 -e . . 

• H 2N 

h n1 

h N2 .. 

• H nn — E 


= 0 . 


(3.92) 


The similarity of this equation to equation (3.55), which arose out of the 
method of linear variation functions, should be noted. If the variation 
functions (f> t had been chosen to be an orthonormal set, then (3.55) would 
reduce exactly to (3.92). Similarly, if the degenerate functions i/q of our 
present method had not been orthonormal, the secular equation would have 
had the form of equation (3.55). 
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It should be remembered, however, that the functions i fo of the pertur¬ 
bation method are not arbitrarily chosen variation functions, but are wave 
functions of the unperturbed system. Furthermore, the method of linear 
variation functions, as we have given it, applies only to the lower energy 
levels, whereas perturbation theory applies to all states of the system. 
None the less, there is a close connection between the variational method 
and perturbation theory, which will be brought out more clearly in our 
general treatment of the latter subject. 


3.7.1. PARTICLE IN A BOX, SUBJECT TO A PERTURBING FIELD 

As an example of the application of first-order perturbation theory to a 
degenerate level, let us consider the particle in a box, discussed in § 2.3, 
when the system is subjected to a perturbing field in which the potential 
energy of the particle is Ax 2 , where A is a small constant. The particle 
might be an electron, for instance, and the perturbing field an electrostatic 
one, proportional to * and in the ^-direction (we will not consider 

how such a field might be produced — the problem is a purely artificial 
one). 

We will discuss specifically the first triply degenerate state, with un¬ 
perturbed energy 


E° 


6h 2 n 2 
2 mil 


(3.93) 


obtained from equation (2.39), and quantum numbers (n x , n y , n z ) which 
can have the values (2, 1, 1), (1, 2,1) and (1, 1, 2). The three unperturbed 
wave functions, from equation (2.38), are 



These functions are normalized, and it can easily be verified that they are 
orthogonal; thus, 
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Jo Jo Jo 
8 r L . 2nx 

= — si 

L 3 Jo 


. nx , 

sin-sin — dx 

L L 


,L . % V . 271V 

sin — sm- 

o L L 


d y 




= 0, (3.94) 

and similarly for the other pairs of functions. 

In the notation used above, the Hamiltonian of the unperturbed state, 
within the box, is 


H° = 



(3.95) 


and the perturbation term is 


H' = Ax 2 . 


(3.96) 


We are not concerned with what happens outside the box, since the potential 
energy is infinite there and the wave functions are zero. 

The secular equation is, then, 


ii -E' 

h\ 2 




H 21 

h 22 -e' 

h' 23 

= 0, 

(3.97) 

H 31 

H' 32 

H 33 -E' 




where E' is, as usual, the first-order energy correction. We must now 
evaluate the quantities H' lJy where 



(3.98) 


We have 


H[ 2 = H' 2i 


8 A 
L 3 



. 2nx . nx , C L . ny . 2ny 
sm — sin — dx I sin — sm — 

L L Jo L L 



= 0 


because 


(3.99) 
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and, similarly, 

#'l3 = #31 = #23 = #32 = 0. (3.100) 

Thus, all the off-diagonal elements of the secular determinant — those 
which do not contain E' — are zero, so that the secular equation reduces to 


(H'u~ E)(H 22 — E')(H' 33 — E') = 0, 


and its roots are EI 22 ^ -# 33 * 

Now, 


a ud 


H' - 8A 

H “~I> 


XL 2 


>L 2 • 2 2rcx f L . 2 ny A f L ■ 2 nz A 

x sin -dx sin - dy sin — dz 

0 L J 0 L J 0 L 

e-i)’ 


h’ 22 = m 


33 


_ 8A f 

" L 3 J, 


= ;.l 2 


‘ L 2 . 2 7IX , f L . 2 , f L . 2 712 j 

x sin — dx sin dy sin — dz 

0 E J q E J 0 E 

(1--L). 

\3 27i 2 / 


(3.101) 


(3.102) 


(3.103) 


since interchanging the y and z in the latter integral does not change its 
value. The integrations involved are elementary— integration by parts gives 


j; 


x sin 


27IX 

~L 


-2 dx = - - 


Z? 

7 




j: 


TZX 


x 2 sin 2 — dx = — —~~z , 
L 6 4tt 2 


and, in addition, we have 



We see, then, that two of the roots of the secular equation are equal, 
which means that the degeneracy is not completely removed to the first 
order. The triply degenerate unperturbed level E° is split by the perturbation 
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into two distinct levels 

e 2 


6h 2 n 2 
2 ml} 

6h 2 n 2 
2 mil 


+ XL 2 





(3.104) 

(3.105) 


the second of these remaining doubly degenerate. 

Although, as remarked above, this example is rather an artificial one, 
it brings out quite simply the main points of the preceding section. 


3.8. Generalized Perturbation Theory 

We will now show how second-order and higher-order perturbation 
corrections to the energy of a system may be obtained. Let us assume 
that we know all the wave functions \J/ t and energy levels Ef of the un¬ 
perturbed system, whose Hamiltonian, as before, is H° — that is to say, 
we know the eigenfunctions and eigenvalues of the equation 

H°iJ/ = (3.106) 

The problem is to find the eigenfunctions ^ and eigenvalues of the 
equation 

H¥ = EW. (3.107) 

where 

H = H° + H'. (3.108) 

Again, to avoid unnecessary superscripts, we have used a small \J/ for an 
unperturbed state and a capital for a perturbed state — here, however, 
it will be necessary to use subscripts to distinguish between different states 
of the same system. 

If we assume that all the energy levels E® of the unperturbed system 
are discrete, and that the wave functions i j/ t form a complete orthonormal 
set, then we may express a wave function V of the perturbed system exactly 
in the form 

OO 

^ = I <#/, (3.109) 

;=i 

by suitable choice of the coefficients a t . Substituting this expression in 
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equation (3.107), we obtain 

Z a i H 'l / i = E Z a i { l / i■ (3.110) 

i i 

Multiplication by i j/J, and integration over the configuration space of the 
system, now gives 

Z «,-J = E Z a,-J ^j^dx. (3.111) 


As before, we use the notation 

H tJ =jVf//<Z-dT, (3.112) 

and we may express the orthonormality of the functions i fa by the equation 

JVflM* = 3„, (3.113) 

where is a standard mathematical symbol, called the Kronecker delta, 
which has the value unity when / = j, and the value zero when i # j. Equa¬ 
tion (3.111) may then be written 

'Z“KHji-ES J d = 0. (3.114) 


There is an infinite number of such equations, corresponding to the 
infinity of integers j, and each equation has an infinite number of terms. 
We may write them out, incompletely, but a little more explicitly, thus 


i(// 1 j — E) + a 2 H l2 + a 3 H 12 + . . . = 0, 

a 21+ 22~a^H23+ . . . = 0 , 

a 1 H3 1 + a 2 H 32 + ci 3 (H 3 3 £)+ . . . = 0 , 


(3.115) 


Just as for a finite set of linear equations (see Appendix 3), in order 
that the a t may not all be zero, the determinant of their coefficients, this 
time of infinite order, must vanish. Hence 


H 


u 


H 

H 


-E 

H 12 

H 

21 

h 22 -e 

H 

31 

1^32 

h 33 


= 0 . 


(3.116) 
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We wish to emphasize that everything we have done up till now has been 
exact — no approximations have been introduced. Of course, there are 
mathematical difficulties connected with the convergence, and the evaluation, 
of infinite determinants, but we will ignore these. Equation (3.116), then, 
is an equation of infinite degree in E, and its infinite number of roots, if 
they could be found, would be the exact energy levels of the perturbed 
system. 

It should also be remarked that equation (3.116) is just what one would 
obtain by means of the method of linear variation functions, discussed in 
§ 3.4, with a trial function of the form given in equation (3.109). We see 
therefore that, ultimately, perturbation theory and the variational method 
are equivalent, and both may provide exact solutions of the problem — 
but with infinite labour! Of course, in both the variational method and 
in the method we have used, any complete orthonormal set of functions 
\j/ h not necessarily wave functions of the unperturbed system, would have 
led to equation (3.116). However, our derivation of the first-order and 
second-order perturbation energies, which we shall give below, depends 
upon our using the set of unperturbed wave functions. 

As in § 3.7, if 

E = E° + E', (3.117) 

we have 


Htj-Ed,, = H% + H' ii -E°8 u -E\5 lr 


= H'n — E'dn, 


since 


Hu = JVftfVy dt = £°J lAfiAydr = E°5 ij 


(3.118) 

(3.119) 


It follows that, as before, equation (3.116) has the same form whether it 
is written in terms of H' and E' or in terms of H and E — this, of course, 
would not be so if the \j/ t were not eigenfunctions of equation (3.106). 
None the less, we shall continue to work in terms of H and E. 

Another point which should be noticed is that, in deriving equation 
(3.116), no mention was made of the size of H' — the equation is correct, 
in fact, whether the effect of H' on IF is large or small. However, for the 
practical purpose of evaluating the energies, with reasonable accuracy and 
without excessive labour, it is important that H' should be a small pertur¬ 
bation, which means that the perturbed wave functions and energies do 
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not differ very much from the unperturbed ones — we shall assume this 
in the following work. 

Now, if the perturbation term were zero, so that the functions i j/ t were 
exact eigenfunctions of H, the off-diagonal elements of the determinant 
in equation (3.116) would vanish, for we should have, for i # j. 


Hij = 




The equation would then reduce to 


H ll —E 0 0 ... 

0 H 22 -E 0 

0 0 H 33 — E . . . 


= 0 , 


(3.120) 


(3.121) 


or 

{H xl -E){H 22 -E){H 22 -E)... =0, (3.122) 

with the roots H 1X , H 22 , H 33 ,... 

Although the perturbation term is not exactly zero, we are assuming 
that it is small, so that it is reasonable as a first approximation to neglect 
the off-diagonal elements and take the perturbed energy levels to be 

E x = H X1 , E 2 = H 22 , etc. (3.123) 

This agrees with what was said before about the first-order perturbation 
theory for a non-degenerate level. If any of the unperturbed levels are 
degenerate, however, equations (3.123) may not provide very useful cor¬ 
rections to these levels. The reason for this is that we have an infinite choice 
of orthonormal sets of wave functions for a degenerate level, and any such 
set may be used in deriving equation (3.116). However, not all of them 
will give equally small values of the off-diagonal elements of the determinant. 
Thus, although equations (3.123) will give the rough positions of the 
perturbed levels, even though the unperturbed levels are degenerate, any 
splitting of the levels, and this is generally of most interest, is likely to be 
very badly represented. To overcome this difficulty we leave in the off- 
diagonal elements which are derived from the degenerate functions only, 
so that there is no question of arbitrarily neglecting terms which might 
be important. 

For example, if the unperturbed wave functions \\]/ x and i j/ 2 are degenerate, 
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and all the rest are non-degenerate, so that £° — £ 2 , but all the other £ ; ° 
are different, we set all the off-diagonal elements equal to zero except 
H 12 and H 21 . Equation (3.116) then becomes 


H tl -E tf 12 0 ... 

//21 H22 E 0 . . . 

0 0 h 33 -e... 


(3.124) 


which factorizes into 


Hu-E H 12 
H21 Et 22 E 


(tf 33 -£)(H 44 -£)... =0. 


(3.125) 


Thus, the perturbed levels corresponding to the non-degenerate i/q are 

E 3 = i/33, E 4 = i/ 44 , etc., 


whereas the perturbed levels corresponding to the degenerate 1 /^ and 1 Jj 2 
are the roots of the equation 


i/ 11 — E H i 2 

i/21 h 22 -e 


(3.126) 


This is just the secular equation we obtained before, equation (3.89), for 
a doubly degenerate level. If the level is iV-fold degenerate — for example, 
if Ej = £ 2 = . . . = £$, so that the functions i 1 J / 2 ,. . . \j/ N are degenerate 
— it is clear that the secular equation (3.92) will be obtained instead. This 
result is quite general and applies to every degenerate level — we have 
chosen the \j/ t with the smallest subscripts as our degenerate wave functions, 
but this is merely for convenience in writing the equations; we have nowhere 
implied in this section that they must correspond to the lowest energy level. 

The same method will obviously be advantageous if a set of 1 j/ h although 
not degenerate, are almost so — in other words, their corresponding £ ; ° 
are very nearly equal. Again, a much better approximation to the per¬ 
turbed levels will be obtained by ‘leaving in’ the off-diagonal elements 
derived from the almost-degenerate functions. 

It is rarely necessary to go beyond first-order perturbation theory for a 
degenerate level, so let us return to non-degenerate levels and consider 
how to find a second-order perturbation correction to the energy. Suppose, 
for example, that £° is non-degenerate, and we wish to find the perturbed 
level £ x corresponding to it. We have seen above that a first approximation 
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to E x is Hu, so let us substitute H X1 for E in every diagonal element of 
the determinant in equation (3.116) except the first, which we now write 
H ll — E 1 . Also, let us retain all the off-diagonal elements which involve 
i p t , and put all the others equal to zero, since they affect E l only indirectly. 
The equation for E 1 is then 


11 Ei 

H 12 

H 13 H 14 

H 21 

h 22 -h ix 

0 0 

H 31 

0 

H 33 -H X1 0 

h 41 

0 

0 h 44 -h xi 


= 0. (3.127) 


To solve this equation we make use of the fact that the value of a deter¬ 
minant is unchanged if to any row is added a constant multiple of any 
other row. We first multiply the second row by H 12 /(H 22 — Hu), and then 
subtract it from the first. The result is that the element H 12 disappears 
from the first row, and the first element becomes 


Hn-E i- 


H 2 2-H n ’ 


all the other elements remaining unchanged. This process is now repeated 
with the third row — this is multiplied by H X3 /(H 33 -H X1 ), and sub¬ 
tracted from the first row. The result is now the disappearance of H 13 
and the further addition of • 


h 13 h 31 

H 33 — H ll 

to the first element. Each repetition removes an element from the first row 
and adds a term of the form -H lJ H jX l{H n -H ll ) to the first element. 
Eventually, then, all the off-diagonal elements of the first row are removed, 
and the first element becomes 




y H u H n 

h Hjj — Hn ‘ 


Expanding the determinant in terms of the first row then gives 


( 


H,i-£ 1 


£ (H 22 -H,,)(H3 3 -H,,) • • • = 0. 

j * 2 H j; — H 11 / 


(3.128) 


Since ij/^ is non-degenerate, none of the factors can be zero except the first. 
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We therefore find 

El = Hll - £ (3.129) 

j-2 — 

H 11 is the first-order approximation, and the second term on the right is the 
second-order correction. Now, we have 

= E°-E° 1 +H' Jj -H' 11 , 

and Hjj—H'u, the difference between two first-order corrections, should 
be very small compared with Ef—E°, the difference between two energy 
levels, so that it is a sufficiently good approximation to write 


E i 


#u- 


y #D#/i 
j=2 E 0 j-E\ ' 


(3.130) 


The same formula holds, of course, for any non-degenerate level. In 
general, the second-order approximation to the perturbed level E h cor¬ 
responding to the non-degenerate unperturbed level Ef, is 


E, = H tt - £ 


HijHj, 


J* 1 E°j — E? 


(3.131) 


Owing to the Hermitian property of H, equation (3.58), this equation may 
be written 


Also, since 


It/ 12 

£. = h u - y . 

rf. c-O 77O 

J*l L: — t: 


Hij =j<pf(H° + H , )il/ J dT 
= J lj/f ij/jdx + 

= fytj+H'tj, 


WH'tjdT 


(3.132) 


(3.133) 


equation (3.132) could equally well be written 


£, = £?+//;,- 1 


W 


IJ I 


j*iE°-E° 


(3.134) 


which gives the first- and second-order corrections to E° directly in terms 
of the perturbation H'. 
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Although it is possible to proceed in this way, and, by retaining further 
off-diagonal elements in equation (3.116), obtain better and better ap¬ 
proximations to the perturbed levels, this is rarely done. If the effects of the 
perturbing term are so great that even second-order perturbation theory 
fails to give a reasonable result, then a different approach to the problem 
is generally attempted. We shall have an example of this later, when we 
discuss the effects of the interaction of electrons in metals. Perhaps it 
should be pointed out, however, that new and powerful techniques of pertur¬ 
bation theory have recently been developed, which offer the possibility of 
solving the perturbation problem exactly, or sufficiently so for all practical 
purposes, by virtually summing the whole perturbation series, to infinite 
order. These techniques are much too complicated for inclusion in the 
present book — and, indeed, there is little reason for including them, since 
their impact upon the electron theory of metals has so far been slight. 

3.9. Matrix Elements 

Although we do not intend to give any account of matrix mechanics in 
this work, some of the terminology of this branch of quantum mechanics 
is regularly used even in work of an entirely wave-mechanical nature, so 
it is advisable that the reader be acquainted with it. 

The quantities H u , defined by 



(3.135) 


as before, are referred to as the matrix elements of the operator H with 
respect to the set of functions i j/ h and the matrix of H is the following array 
of these quantities: 



(3.136) 


It must be understood that this is not a determinant, which is why curved 
brackets, or sometimes square brackets, are used, instead of lines, to enclose 
the array. There are mathematical rules for the addition and multiplication 
of such matrices, just as there are for vectors — indeed, a vector can be 
represented by a particularly simple kind of matrix. Similar matrices may 
be set up for other operators, and matrix mechanics deals directly with 



90 THE VARIATIONAL METHOD AND PERTURBATION THEORY [Ch. 3, § 9 


these rather than with the operators and wave functions of wave mechanics. 

The matrix elements H liy H 22 , H 33 , etc., are called diagonal elements. 
If the i j/i are eigenfunctions of H, so that 

H\J/ t = E^ t , (3.137) 

then 

H u = Ej5 ijt (3.138) 

and the Hamiltonian, or energy, matrix (3.136) reduces to 


[E x 0 

/ 0 e 2 
1 0 0 



(3.139) 


which is called a diagonal matrix, and the diagonal elements are the energy 
levels of the system. For this reason the problem of finding the energy levels 
and wave functions is often referred to as the diagonalization of the 
energy matrix. 


CHAPTER 4 


THE MANY-ELECTRON PROBLEM AND THE 
ONE-ELECTRON APPROXIMATION 


4.1. The Many-Electron Atom 


Let us now consider the general case of an atom with nuclear ctiarge Ze, 
or atomic number Z, and N electrons. If N = Z, the atom is neutral, since 
its total charge is zero; but N may not equal Z, in which case the atom 
is ionized. We shall again treat the nucleus as a fixed centre of force, this 
being an even better approximation for the 
heavier atoms than it is for hydrogen and 
helium. 

Extending the notation used in the case of the 
helium atom, we denote by r, the position 
vector of the zth electron, and by the distance 
between the zth and y'th electrons. The potential Fi s- 4 - 1 - A many-electron 
energy of the zth electron in the field of the atom- 

nucleus is —Ze 2 /r { , and the mutual potential energy of the zth and jth 
electrons, due to their electrostatic or Coulomb interaction, is e 2 /r, v . Sum¬ 
ming the former over all the electrons, and the latter over all the pairs of 
electrons, the total potential energy of the system is found to be 



N Ze 2 l N N e 2 

V(r l , r2 ,...r s )=-Z— + 

i=i r t 2 i*j r u 


( 4 . 1 ) 


The second term is a double sum over all values of z and j, those terms 
in which z and j have the same value being excluded, since an electron does 
not repel itself. The factor \ arises because the double sum as it stands 
contains the interaction between any pair of electrons twice — e 2 /fy and 
e 2 lrji both equal the potential energy of interaction between the zth and /th 
electrons. It is easily verified that this term may equally well be written 
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the double sum now being over all values of j and over those values of i 
which are greater than j, so that no factor \ is required. It should be noted 
that often a single summation sign is used to represent double, or mul¬ 
tiple, sums. 

The zero of potential energy is here obtained when all the particles of 
the system are infinitely separated. Any magnetic interaction among the 
electrons, due to their motion, is neglected, as it is of very much smaller 
order of magnitude than the electrostatic interaction. 

The quantum-mechanical Hamiltonian operator, from equation (2.20), is 


N f 2 N -7 2 i N N „2 

1=1 2m i=i r t 2 r tj 

N / > 2 7 2\ i N N 2 

= + EX-. 

i=i\2m r { 1 2 i*j r u 

and the Schrodinger equation is 


(4.2) 


H'F = EW. (4.3) 

Here, and in future, we shall use a capital V for the wave function, 
x E{r l , r 2 . . .); of a system of several electrons, reserving the small i j/ for 
a function of the coordinates of a single particle. 

The main difficulty with the solution of equation (4.3) is the presence of 
the last term in H, the electronic interaction term. If this term were absent, 
we could write 

V = t/' 1 (r 1 )</' 2 (i‘ 2 ). . . i K(r N ), (4.4) 


and equation (4.3) would become 



*/'l*/'2 • • • l/fiv — ^2 


■ • •/'n- 


(4.5) 


+ 


Ze‘ 


Dividing through by W gives 

N 1 /ft 2 

and, as each term of the sum depends upon the coordinates of one particle 
only, this equation can be separated into the N equations 


= E, 


(4.6) 


-1*1 v? + Ze 

’2m 


^ ) tit = Zi'Pb 

n ' 


i = 1,2, ... N, 


(4.7) 
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where 

T,e t = E. (4.8) 

i = 1 

Each of these equations is of hydrogenic form and can therefore be solved 
analytically. Unfortunately, the electronic interaction term cannot be 
neglected in this way and it is impossible to obtain an analytical solution 
of equation (4.3), as we have already seen in the simplest case of helium. 

Of course, the interaction term could be treated by perturbation theory, 
using unperturbed functions of the form given in equation (4.4). However, 
the Coulomb interaction is by no means a small perturbation, so that 
first-order perturbation theory is not expected to give accurate results — 
it will be remembered that it gave an error of 5 per cent in the ground state 
energy of helium, and the error increases with the number of electrons. 
Also, convergence difficulties occur in the case of second-order perturbation 
theory, which involves the summing of an infinite series. Treating the whole 
of the Coulomb interaction as a perturbation, therefore, is not very helpful 
in general. 

An approximation to the energy of the ground state may be found by 
means of the variational method, but reasonable accuracy, without excessive 
labour, is only attainable in this way for the lighter atoms, such as helium 
and lithium. Even for helium, trial functions which give close agreement 
with experiment are by no means simple, and, as the number of electrons 
increases, the complexity of the trial function required makes the method 
impracticable. 

Any analytical solution of the problem, even an approximate one of 
reasonable accuracy, would therefore seem to be out of the question for 
the heavier atoms. Consequently, equation (4.3) must be solved by numerical 
methods, with the aid of calculating machines if possible. In other words, 
the attempt to approximate to V by a closed analytical formula is abandoned, 
and by direct numerical computation we attempt to obtain the function 
in tabulated form, that is, as a table of values corresponding to closely 
spaced points covering the whole of configuration space. A difficulty arises 
here, however, for, in the case of a heavy atom, ¥ is a function of a large 
number of variables — the coordinates of all the electrons. It has been 
aptly said that a function of one variable might be adequately tabulated on 
a single page, a function of two variables would take a book, and a function 
of three variables would need a library. Now, a neutral atom of sodium, 
for example, which is not particularly heavy, has 11 electrons, so that 'F 
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is a function of 33 variables, and therefore quite impossible to tabulate 
adequately. 

When dealing with systems of many electrons, then, we are virtually 
compelled to adopt what is known as the one-electron approximation, which 
we shall discuss in the following section. This, by ascribing to each electron 
a separate wave function, reduces the problem to that of tabulating functions 
of three variables, or, with a slight further approximation, functions of a 
single variable — one table for each electron in the atom. 

4 . 2 . The One-Electron Approximation 

We now assume, as would be the case if the Schrodinger equation were 
separable, that each electron can be represented by its own wave function, 
which is a function of the coordinates of that electron only. T his one-electron 
wave function is calculated just as if the problem were that of a single 
electron moving in the field of the nucleus and some additional field which 
represents the average effect of all the other electrons. Thus, the equation 
which the one-electron function ^(i-,) of the zth electron satisfies is taken 
to be of the form 



(4.9) 


where the potential function Vfa) approximately represents the inter¬ 
action of the zth electron with all the other electrons. The initial problem 
is to choose a suitable form for the function V t , but let us defer this for 
the moment, except to remark that we take V t to be spherically symmetrical, 
which is very reasonable for an atom and simplifies the solution of equation 
(4.9) enormously. There are N equations like (4.9), one for each electron, 
and the function V { could be taken to be the same for all of them, in which 
case we could dispense with the subscript i; but this is not necessary, and, 
for a reason which will become apparent, it may be rather a poor approxima¬ 
tion. If V ( were zero, of course, equation (4.9) would reduce to equation 
(4.7), but we have seen that this does not lead to results of great accuracy. 
Notice that in equation (4.9) we have used a Greek letter e to denote the 
eigenvalue — we shall retain this notation for the eigenvalues of one- 
electron equations throughout the remainder of the book. 

Although equation (4.9) is not hydrogenic, since F^r,) will not in general 
be proportional to 1 /r i , the potential energy function still has spherical 
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symmetry, so that the problem is one of a particle in a central field, as 
discussed in § 2.4. Any eigenfunction of equation (4.9) may be expressed, 
in spherical polar coordinates, as a product of three functions 


Hr, 9,cf>) = R(r)0(0)0(<f>), 

and the equation separated into three ordinary differential equations for 
R, 0, and 0. The angular functions 0 and 0 will be exactly the same as 
in the hydrogenic case, but the radial function will not, although it will 
have a similar form. The solutions will depend upon three quantum numbers 
n, l, and m x , where « — /— 1 is equal to the number of nodal spheres, / is 
equal to the number of nodal planes, and m x can have any integral value, 
including zero, from —/to +/. The states are specified as Is, 2s, 2p, 3d, 
and so on, according to the values of n and /. It should be noted that the 
eigenvalue e will in this case depend upon both n and /, not only upon n, 
which is peculiar to the hydrogenic atom. 

Now, having solved the N equations, we may take one eigenfunction of 
each and write an approximate total wave function 0 as their product 


V = tAiO'O'/'aO'z) • • • <A N(r N ), (4.10) 

just as in equation (4.4), and an approximation to the total energy E of 
this particular state is given by 


_ J y*H^dT 
J W*Wdz 


(4.11) 


It is easy to verify that E is not, in general, equal to the sum of the eigen¬ 
values Sj . These eigenvalues do not really correspond to any energies which 
are experimentally measurable. However, we shall see that, with the best 
choice of the function V x , the magnitude of e x is very roughly equal to the 
ionization energy of the electron in the state t/q, that is, the energy required 
to remove this electron completely from the atom. 


4.3. The Pauli Exclusion Principle 

Equation (4.9) has, of course, an infinite number of eigenfunctions, as 
also have the other N— 1 equations. It might at first be thought possible 
to choose any of these eigenfunctions, one from each equation, for in¬ 
sertion in (4.10), and so obtain an approximation to some energy level of 
the system. For atoms with more than two electrons this is found not to be 
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the case — many of the resulting values of E approximate to no energy level 
which can be deduced from spectroscopic data. 

We have previously remarked that the one-electron states may be Is, 
2s, 2p, 3s, 3p, and so on, and the configuration of a state of the atom as a 
whole is just the number of electrons in each type of one-electron state. 
Now, it is generally found that the energy rises along the given series — 
in other words, a 2s state has a higher energy than a Is state, and so on. 
This might lead us to expect that, in the ground state of an atom, all the 
electrons will be in Is states, the states of lowest energy. This is perfectly 
true for hydrogen and helium, with one and two electrons respectively. 
The ground state of hydrogen is certainly a Is state, and the configuration 
is written, simply, (Is). The ground state of helium corresponds, in the 
one-electron approximation, to both electrons being in Is states, and the 
configuration is written (Is) 2 , the superscript denoting two electrons. 
However, when we come to a neutral atom of lithium, which has three 
electrons, the lowest energy level is found to correspond not to all three 
electrons being in Is states, which would be the configuration (Is) 3 , but to 
two electrons being in Is states and one electron in a 2s state, and the 
configuration is written (Is) 2 (2s). In the same way the ground state, or normal , 
configuration of beryllium, with four electrons, is found to be (Is) 2 (2s) 2 , 
and that of boron, with five electrons, is found to be, not (Is) 2 (2s) 3 , but 
(ls) 2 (2s) 2 (2p). Proceeding in this way, comparing the observed ground 
state energies with the energies given by the one-electron approximation, 
a general rule has been deduced, which is as follows: 

not more than two electrons may occupy any orbital state. 

This is known as the Pauli exclusion principle and must be added to our 
list of the fundamental principles of quantum mechanics — it cannot be 
deduced from any of the principles we have introduced previously. It 
should be emphasized, however, that the above form of the principle is 
an approximation, albeit a very good one, because the one-electron states 
to which it is applied exist only as approximations. In the following chapter, 
after introducing a further property of electrons, known as electron spin, 
we shall give a more fundamental statement of the Pauli principle, which 
is independent of the one-electron approximation, but which reduces to 
the above whenever the one-electron approximation is used. We have used 
the name orbital state to distinguish the one-electron states we are discussing 
in this chapter, and which are functions only of the positional coordinates 
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of an electron, from one-electron states in which electron spin is included. 

The normal configuration of an atom is thus obtained by filling the 
orbital states of lowest energy in a manner consistent with the Pauli prin¬ 
ciple. It is important to notice that the degeneracies of the one-electron 
functions must be taken into account. Thus, every p state is triply degenerate, 
corresponding to three possible values —1,0, 1 of the magnetic quantum 
number m lt and the Pauli principle permits two electrons to each value 
of nii, so that we may have six electrons in p states with a given value 
of the principal quantum number n. For example, the normal configuration 
of sodium, which has eleven electrons, is (ls) 2 (2s) 2 (2p) 6 (3s). Similarly, as 
d states are fivefold degenerate, we may have as many as ten electrons in 
d states with the same value of n. 

We do not wish to give the normal configurations of all the atoms, but 
we summarize those already quoted and add one or two others in the 
following list, the total number of electrons being given in brackets after 
the symbol of the element: 


H (1) 
He (2) 
Li (3) 

Be (4) 

B (5) 

Na (11) 
Mg (12) 
K (19) 
Ca (20) 
Ni (28) 
Cu (29) 
Ag (47) 


(Is) 

(Is ) 2 

(Is) 2 (2s) 

(Is) 2 (2s) 2 
(Is) 2 (2s) 2 (2p) 

(Is) 2 (2s) 2 (2p) 6 (3s) 

(Is) 2 (2s) 2 (2p) 6 (3s) 2 

(Is) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (4s) 

(Is) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (4s) 2 
(Is) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (3d) 8 (4s) 2 
(Is) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (3d) 10 (4s) 

(Is) 2 (2s) 2 (2p) 6 (3s) 2 (3p) 6 (3d) 10 (4s) 2 (4p) 6 (4d) 10 (5s). 


It should be noted that the states of higher energy are not always filled 
in the order suggested by the energy levels of hydrogen. In hydrogen, where 
the energy is — 1/n 2 ryd, a 3d state has lower energy than a 4s state, but 
we see that in potassium and calcium, for example, 4s states are filled before 
any 3d state has been filled. Similarly, in nickel, with 28 electrons, two 
electrons are in 4s states, whereas in copper, with 29 electrons, only one 
electron is in a 4s state, the other having gone into a vacant 3d state. These 
anomalies are due to the fact that any electron is ‘screened’ from the full 
effect of the field of the nucleus by the presence of the other electrons. 
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and this screening varies according to the state of the electron, in such a 
way as to cause some electrons in different shells to have very similar energies. 
The order of filling the one-electron states would appear to be roughly as 
follows: 

Is, 2s, 2p, 3s, 3p, 4s, 3d, 4p, 5s, 4d, 5p, 6s, 4f, 5d, 6p, 7s, 5f, 6d. 

Electrons in states with the same value of the principal quantum number 
n are said to occupy the same shell. If a shell contains its maximum number 
of electrons consistent with the Pauli principle it is said to be closed. Shells 
corresponding to n = 1, 2, 3, 4, 5 are denoted by the letters K, L, M, 
N, O respectively, and the number of electrons in a closed shell is In 2 . 
We thus have 

n = 1: K shell ( 2 electrons) 
n = 2: L shell ( 8 electrons) 
n = 3: M shell (18 electrons) 
n = 4: N shell (32 electrons) 
n = 5:0 shell (50 electrons). 

Frequently the electrons in states with the same values of n and l are referred 
to as ‘shells’ — for example, the ten 3d electrons in copper are often called 
the ‘3d shell’. This can be misleading, however, so we shall refer to such 
groups of electrons as sub-shells. The chemical behaviour of an atom depends 
upon those electrons which lie outside closed shells and sub-shells, and 
this can easily be shown to provide an explanation of the Periodic System 
of the elements, but we will omit details. 

The whole of the foregoing discussion has been based upon the solutions 
of equations like (4.9), but we have not so far considered any specific method 
for constructing the functions V { . Several methods have been used, but 
we will at present confine our discussion to the most popular and most 
successful of these, which was first suggested by D. R. Hartree in 1928. 
This method, known as the method of the self-consistent field, not only 
makes a direct appeal to physical intuition, but, as we shall see, can be 
justified by means of the variation principle. 

4.4. The Hartree Method Applied to the Ground State of Helium 

Before describing the Hartree method in general it might be useful to 
consider its application to the normal helium atom, which was the first 
atom to be treated by this method. According to the general procedure 
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described in § 4.2, we write the total wave function in the approximate 
form 

^(r 1 ,r 2 ) = i// 1 0 1 )i// 2 (r 2 ), 


where the one-electron function \j/ 1 satisfies the equation 


h 2 ^ 2 2e 2 


Vi- — d-nCrO 

2 m r i 


<Al = Cl'l'l, 


(4.12) 


(4.13) 


and there is a similar equation for i j/ 2 . In fact, for the ground state, we 
know that the functions \j/ 1 and \j/ 2 will both be the same 
Is function, so that there is really only one equation 
to solve, but let us disregard this for the moment. 

Hartree chose the potential function V^r^ to be r 
the potential energy of the first electron, or electron 1, ■ 
in the field of the average charge distribution or ® 
charge cloud of electron 2. As described in § 2.6, 
we may regard the average charge distribution of 
electron 2 as a charge cloud whose density, at any HartreeTheory of 
point r 2 , is -e\\ ^ 2 (i* 2 )| 2 , and the potential energy the helium atom, the 
of electron 1 due to a charged volume element dr 2 potential energy of 
is e 2 \\J/ 2 (r 2 )\ 2 dr 2 /r i2 . The potential energy of elec- nucleus andthe charge 
tron 1 due to the whole charge cloud is, thus, cloud of electron 2. 





V,(r t ) = e>m 

J r 1 


d r 2 , 


(4.14) 


the integration being throughout all space, 

It may remove a source of confusion if we point out that this definite 
integral is not a function of r 2 , so that any symbol, except r x , could be 
substituted for r 2 . The distance r 12 is defined as li^— r 2 \, so that we could 
equally well write, for example, 


Vi 



feWI 2 

ln-r| 


dr. 


(4.15) 


The Hartree equation for ij/ 1 is, therefore, 


2m rj J 


l</' 2 0 2 )l : 


dr. 


12 


<Ai = ei'Ai, 


(4.16) 


with a similar equation for i// 2 , having the subscripts 1 and 2 interchanged. 
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If we now concentrate on the ground state, a simplification is introduced, 
as previously remarked, because \]/ 1 and t J/ 2 are then the same function 
[and are also spherically symmetrical, which justifies our implied as¬ 
sumption that Vfirf) is spherically symmetrical]. We have, in fact, 


V'Ot ,r 2 ) = tAObjiK^), 


(4.17) 


and the equation for \j/(r) is 



(4.18) 


We have dropped all subscripts here, to emphasize that the equation is 
the same for both electrons, and this necessitates the use of r' , or some 
symbo' other than r, as the variable of integration. This equation is true, of 
course, only if \p is normalized. 

The problem is simply to find the eigenfunction of equation (4.18) which 
has the lowest eigenvalue. Clearly, this cannot be done analytically, and 
cannot even be done directly by numerical integration, because the potential 
energy depends upon the function ip. An equation of this sort is called an 
integro-differential equation, and it is necessary to employ a method of 
successive approximations in order to solve it. We first choose an approxi¬ 
mation to ip — for example, we might choose the function exp (—21 r/ 16fl 0 ), 
which gave reasonably good results in the variational method — and, aftei 
normalizing this chosen function, we use it to evaluate the integral in 
equation (4.18). This equation then becomes an ordinary Schrodinger 
equation, which we can solve by numerical methods, and its solution 
should be a better approximation to \p. This process is repeated again and 
again until two successive repetitions yield results which do not differ 
significantly — this means that, in the final repetition, practically the same 

is used in calculating the potential function as is found by solving the 
equation. The solution is then said to be self-conoistent, and the associated 
field is called the self-consistent field. 

The rapidity of convergence of the process depends, of course, upon 
the closeness of our original approximation to t f/. Also, it must be remem¬ 
bered that equation (4.18) has an infinite number of eigenfunctions, so 
that at each stage we must be careful to choose the solution which in¬ 
terests us — in this case, for the ground state, we require a Is type of solution, 
that is, a spherically symmetrical function without nodes. 





Ch. 4, § 4] 


THE HARTREE METHOD 


101 


Since is normalized, the total wave function must also be, for 

Jj '?*f'dr 1 dr 2 = JI^WPdr.J \<P(r 2 )\ 2 ir 2 = 1. (4.19) 

Equation (4.11) therefore gives, for the approximate energy of the ground 
state, 


E = 


I 


d t 


= ^r(ri)r(r 2 ) 

= 2e— f f — l*A(^ 2 )| 2 d»* 1 d»' 2 , 

J J r 12 


- — (V? + Vf)- — - — + — 
2m r 2 r 2 r 12 


J^(ri)^(r 2 )dfi 


dr. 


(4.20) 


using equation (4.18). 

The eigenvalue e has already been found during the calculation of i j/, 
and the integral, which represents the mean interaction energy of the 
electrons, can be evaluated by the method given in Appendix 4 — numerical 
integrations are necessary, of course, since i j/ is obtained as a tabulated 
function. In this way E is found to be —5.73 ryd, compared with the ex¬ 
perimental value of — 5.81 ryd. The error of 1.4 per cent is only two thirds 
of that obtained by the variational method given in § 3.3. The latter, however, 
employed a trial function which was the product of two simple exponential 
functions — by the use of more complicated trial functions the variational 
method can, as remarked before, give results of much greater accuracy. 
We shall show below that the Hartree wave function is, from the point of 
view of the variation principle, the best wave function of the simple product 
form (4.12). 

The error is due essentially to the fact that the electrons do not move 
independently, which is neglected in the Hartree method. Owing to their 
mutual repulsion, the electrons tend to avoid each other, and their motions 
are said to be correlated. The Hartree method, although it does not com¬ 
pletely ignore the interaction between the electrons, does not account for 
this correlated motion. The energy given by the Hartree method is therefore 
too high, for it assumes that the electrons spend more time in the im¬ 
mediate vicinity of each other than is actually the case. We shall see later 
in the book that this question is of great importance in the theory of metals. 
Of course, in the variational method, the correlation between the motions 
of the electrons can be taken into account by the inclusion of terms involving 
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r 12 , the distance between the electrons, in the trial function, but the variational 
method is not a practical proposition for systems of large numbers of elec¬ 
trons. 


4.5. The Hartree Method in General 

The generalization of the foregoing procedure to the case of N electrons 
is so obvious that we need do little more than write down the results. Each 
electron is supposed to move in the field of the nucleus and that due to 
the charge cloud of all the other electrons. The potential energy of electron 
i, due to the latter, is just the sum of expressions like (4.14), one for each 
electron except electron i itself — in other words, it is 



Since we are at the moment dealing with atoms, we have again written 
this as a spherically symmetrical function — a function of r,- rather than 
of r t . This is always so if the i j/j are s states, and it can be shown that the 
total charge cloud of the electrons in closed shells and sub-shells is spherically 
symmetrical, so that their contribution to V i must also be. An incomplete 
p or d sub-shell might, however, have a charge cloud which is not spherically 
symmetrical — in this case, for purposes of computation, an average is 
taken over the angular coordinates so as to obtain a spherically symmetrical 
contribution to V t . This is an additional approximation, but gives rise to 
an error which is negligible compared with those already involved. We wish 
to make it quite clear, however, that this spherical symmetrization is only 
justifiable for atoms. The Hartree method can, of course, in principle be 
applied to molecules and to solids — we shall, in fact, consider it in relation 
to metals later in the book — and in these cases, although the Hartree 
potential function is still that given in equation (4.21), it is no longer even 
approximately spherically symmetrical, since there are several centres of 
force present. 

The Hartree equations for the system are obtained by substituting (4.21) 
in equation (4.9): 



M r i) = etH't), 


i = 1,2, ... N. (4.22) 
We are thus faced with the problem of solving N integro-diflferential equa- 
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tions simultaneously, since the potential function in each depends upon the 
solutions of all the others. This would appear to be, and indeed is, a for¬ 
midable task, but it is by no means an impossible one for atoms, and the 
method has been applied successfully to a large number of them. The 
procedure is similar to that given for helium. Bearing in mind the con¬ 
figuration of the state whose energy we are seeking, we choose a set of ap¬ 
proximate i pi and with them calculate the TV potential functions. The Hartree 
equations then become ordinary Schrodinger equations and we may solve 
them all by numerical methods, choosing in every case the particular 
eigenfunction we require, that is, Is, 2s, 2p, and so on. With these solutions 
we re-calculate the potential functions and repeat the whole cycle of opera¬ 
tions. We go on repeating until self-consistency is attained, that is, until 
the last set of solutions differs negligibly from the previous set. 

The approximate total wave function is now taken to be 

V = </'iOi)«/' 2 0‘2) • • • * Mr N ), (4.23) 

and this is normalized, since the one-electron functions are — it must be 
remembered that the solutions have to be normalized at every stage of the 
above calculation. The approximate total energy is, thus, 


= J rj/f dt = J... J *♦(„) ■ ■ • «(<•*) [- 1 (4- vf + 5L) 


1 * " 

+ ill — 

2 1*J r {j . 


i( r i) • • • • • • dr A 


- -UM 


2 m r t 


ipi(ri)dri 


+ ! II f f ~ M r i)'l'/rj) dr i dr j' 

2 i*j JJ r u 


(4.24) 


all the other factors being unity owing to the normalization of the \p ; . 
Now, using equations (4.22), we find 

E = I - £ I f f- Wr l )| 2 I^Xr,)| J dr,dr y . (4.25) 

i= 1 2 i*S JJ Tij 

The total energy is not just the sum of the one-electron eigenvalues, 
therefore, but this minus the average interaction energy of the electrons. 
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4.6. The Hartree Method and the Variation Principle 

Suppose we wish to apply the variational method to our iV-electron 
atom, using a trial function of the product form (4.23). We now make no 
assumption about the l J/ t , except that they are normalized, the problem 
being to choose these one-electron functions so as to obtain the best 
approximation to the ground state energy of the system. In other words, 
the problem is to choose the i \J/ t so as to minimize the quantity E given by 
equation (4.24). 

Now the only terms on the right-hand side of equation (4.24) which 
contain the particular function i/q are 


-JV,*(«■,) Zc 


h_ 

\lm 


Vf + 


-) </'.(»'«) d »*i 

i ' 


N 


+ 


and this expression may be written 


where 


h 2 ■, 7e 2 N r 

2m r t j*iJ 



(4.26) 

5 

(4.27) 

d 

r { ; 

(4.28) 


If the one-electron functions are to be chosen to minimize E, then i/q 
must be chosen so as to minimize (4.27), for no other terms contain i/q. 
The variation principle tells us, however, that the function i/q which mini¬ 
mizes expression (4.27) is the eigenfunction, corresponding to the lowest 
eigenvalue e t , of the equation 


or 



(4.29) 


Comparing this equation with (4.22) we see that it is, in fact, the Hartree 
equation for electron i. 

The variational method with a trial function in the form of a single 
product therefore gives the same one-electron functions as are obtained by 
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the method of the self-consistent field. We may conclude that the Hartree 
functions are the best one-electron functions to be used in a single product 
form of wave function for the ground state. Admittedly, the application of 
the Pauli principle causes a slight complication, for it does not, except 
in the case of helium, permit all the one-electron functions to be the eigen¬ 
functions of lowest energy of the corresponding Hartree equations, but a 
full discussion of this lies outside the scope of the present work. 

Now, if we remove electron i from the atom, that is, if we remove i// ; 
from IP, then the terms (4.26) are removed from E. Consequently, if we as¬ 
sume that all the other one-electron functions are unaltered by this opera¬ 
tion, we have merely reduced E by the amount e ( -, this being the value of 
(4.26) or (4.27), as shown by (4.29). In other words, — e t is approximately 
the energy required to remove electron i from the atom. This is not quite 
accurate, in fact, because the removal of any electron alters the self-con¬ 
sistent fields and wave functions of the others. None the less, it is reasonably 
accurate, particularly for the inner shell electrons — those electrons whose 
charge clouds are concentrated close to the nucleus. 

Although the Hartree method gives the best ground state wave function 
of the form (4.23), it is not the most accurate method of computing the 
energy, simply because this single product of one-electron functions is not 
the best approximation to the total wave function. We shall consider im¬ 
provements to the Hartree method in later chapters. 


CHAPTER 5 


ELECTRON SPIN AND THE PAULI PRINCIPLE 


5.1. Orbital Angular Momentum and Magnetic Moment 


Let us first consider the classical picture of an electron moving in a 
small circular orbit of radius r, with speed v. 

For convenience we will assume that the orbit 
is in the plane z = 0 and is centred at the origin 
O, and also that the motion is clockwise when 
sighting along the positive z-axis. 

The components of the angular momentum of 
the electron about O are thus 

L x = L y = 0, L z = mrv. (5.1) 

Now the motion of the electron is equivalent 
to an electric current / flowing around the 
orbit, and I is equal to the charge — e divided 
by the time taken to make one revolution, or 



y 


Fig. 5.1. Electron moving 
in a circle of radius r, 
centred at O, in the plane 
z = 0. 




(5.2) 


in the same direction as that of the electron. 

Also, it is known that a current I in a small loop of area A is equivalent 
to a magnetic dipole of magnetic moment IA/c, in Gaussian units, per¬ 
pendicular to the plane of the loop, in a sense given by the right-handed 
screw rule with respect to the sense of the current. In our present case, then, 
the motion of the electron gives rise to an orbital magnetic moment whose 
z-component is 


_ evr e 

“ 77 “ ~ 277 ” 


(5.3) 


the other two components being zero. 

A magnetic dipole of moment H has a potential energy —fi-H in a mag¬ 
netic field ##. If we assume, in our example, that there is a field H in the 
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positive z-direction, the electron will have a potential energy 

~fi z H = —L z H 


2 m c 


(5.4) 


due to the interaction of its orbital magnetic moment with the field. This 
result remains true whatever the shape and orientation of the orbit. In 
fact, if the electron is moving in a central electrostatic field, its orbit will 
not in general remain in the same plane when a magnetic field is switched 
on, but will precess around the direction of the magnetic field, and the 
energy of this precessional motion is given by (5.4). 

What is more important for our purposes is that the result (5.4) remains 
true in wave mechanics, except that now L z is quantized, that is, it may 
only assume a discrete set of values. We have seen in § 2.8.2 that, for a 
particle in a central field of force, L z = m t h and can have 2/+1 values 
corresponding to the possible values -/, -/+1,... / of m u the magnetic 
quantum number. Consequently, the z-component of the magnetic moment 
can have any one of the 2/41 values given by 


and in the presence of a magnetic field H in the positive z-direction the 
(2/4-1 )-fold degenerate level with azimuthal quantum number / will split 
into 2/4 1 separate levels spaced at intervals of ehH/2mc. This gives rise 
to what is known as the normal Zeeman effect in atomic spectra, whereby 
some spectral lines are split into three by a magnetic field. The reason 
why only three lines appear, rather than in general 2/41, is due to the 
operation of certain selection rules, which we shall not prove, but which 
state that in transitions from one state to another, due to the absorption 
or emission of light, m x may change only by O.or 41. 

We do not wish to go into full details of the wave-mechanical treatment 
of atoms in magnetic fields, but the foregoing discussion can be made more 
plausible in a very simple way. The Schrodinger equation for an electron 
in a central field is 



(5.6) 


2m 


say, and we have seen in § 2.4, equation (2.67), that the eigenfunctions 
may be written 



(5.7) 
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those with m t — —l, —l+l,... I being degenerate. Now, if a magnetic 
field H is switched on, in the positive z-direction, we expect the potential 
energy to be increased by the amount given in (5.4). Representing L z by 
the quantum-mechanical operator 


L z = 


hd_ 
i 50’ 


(5.8) 


as given in equation (2.139), the Schrodinger equation for the system in the 
presence of the magnetic field becomes 


- —V 2 +V(r) + 
2m 


ehH 5~ 
2mci 50_ 


0 = £ 0 . 


( 5 . 9 ) 


Although the derivation has not been rigorous, this equation is correct, 
in fact, provided the field H is not too large. Since 0 only occurs in the 
factor e im '^ in the expression (5.7), we see that 



n, l, mi 




(5.10) 


so that 0 n ,; )mi is also an eigenfunction of equation (5.9). Our choice of 
eigenfunctions was therefore very fortunate, for had we chosen linear com¬ 
binations of the functions (5.7) with given n and /, which would have been 
equally good eigenfunctions of equation (5.6), we should have found 
that they were not eigenfunctions of (5.9). From (5.6), (5.9) and (5.10), 
we find 


£ 


£° + 


ehH 
2m c 


( 5 . 11 ) 


so that the eigenfunctions 0„ >/>m , of (5.9) are not degenerate, but each value 
of m t corresponds to a different energy, and there are 21 +1 separate levels, 
at intervals of ehH/2mc, with given n and /. It should be noted that, if the 
electron is in an s state, there should be no splitting of the energy level 
in a magnetic field, due to the orbital magnetic moment, for / and m t are 
zero. 

In order to apply these ideas to many-electron atoms, we make use of 
the one-electron approximation described in the previous chapter. For a 
closed shell or sub-shell of electrons the z-components of the total orbital 
angular momentum and magnetic moment are clearly zero, for to every 
electron with a given positive value of m l there is an electron with an 
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equal negative value of m x . A magnetic field should therefore have no effect 
upon the closed shells and sub-shells of an atom. An atom such as that 
of silver, which consists of closed shells and sub-shells, apart from one 
electron which is in a 5s state, and so without orbital magnetic moment, 
should therefore be unaffected by a magnetic field. The fact that this is 
not so is one of the reasons for introducing the concept of electron spin, 
which we shall discuss in the following section. 

Beginners are often worried by the fact that the z-axis seems to have 
arbitrarily been singled out as the axis along which the angular momentum 
is quantized. It should be realized that this is not arbitrary, but is due to 
the fact that the magnetic field has been taken to be in the z-direction. 
The wave functions (5.7) are eigenfunctions of L z , and they are only eigen¬ 
functions of equation (5.9) also because the magnetic field is in the z- 
direction. If we had taken the magnetic field to be in the x-direction, say, 
the eigenfunctions of the corresponding Schrodinger equation would have 
been found to be simultaneously eigenfunctions of L x , not L z , and the 
values of L x would have been quantized. The splitting of the energy levels, 
however, would have been unaltered. 


5.2. Electron Spin 

The theory we have presented so far fails to account for several known 
experimental facts. In the first place, where the theory would predict a single 
line in the spectrum of an atom, a group of closely-spaced lines is often 
observed — this is known as the fine structure of spectra. A well-known 
example is that of the so-called sodium D-lines — two prominent fines 
occur in the spectrum of sodium, corresponding to the wavelengths 5890 A 
and 5896 A, where only one should be observed. Such a pair of fines is 
called a doublet, a group of three fines is called a triplet, and in general a 
group of several fines is called a multiplet. 

Again, in the presence of a magnetic field, where the normal Zeeman 
effect would give rise to a triplet, generally a larger number of fines is ob¬ 
served. This is known as the anomalous Zeeman effect. 

Another striking example is provided by the results of the experiments 
of Stern and Gerlach on the splitting of beams of atoms in an inhomogeneous 
magnetic field. It is known from classical electromagnetic theory that an 
atom with magnetic moment p in a magnetic field H will experience a force 
(p ■ V)#/, or 
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the derivatives being evaluated at the position of the atom. In particular, 
if the field is in the z-direction, and varying with z only, as we will assume, 
the total force will be p z dH/dz in the z-direction. If the field is homogeneous, 
or uniform, the force is zero, since dH/dz is zero — this is quite obvious 
if we picture the atom as an elementary magnet, for in a uniform magnetic 
field the forces on the two poles would be equal and opposite. A beam of 
atoms would therefore not be deflected on passing through a uniform 
magnetic field. From the discussion of the previous section we should, 
however, expect a beam of atoms generally to be split into several beams 
on passing through an inhomogeneous magnetic field. If, for example, they 
are hydrogenic atoms, all in the same state with azimuthal quantum number 
/, we should expect the beam to be split into 21+ 1 beams, corresponding 
to the 2/+1 values of p z . The same splitting is expected if the atoms have 
several closed shells and sub-shells of electrons and one additional electron 
in a state with azimuthal quantum number /. It is clear, however, that if 
this single electron is in an s state no splitting is expected, since / = 0. An 
example of this is the normal silver atom, which has 46 electrons in closed 
shells and sub-shells and one additional electron in a 5s state. None the 
less, Stem and Gerlach found that a beam of silver atoms was split by 
an inhomogeneous magnetic field into two separate beams. The same effect 
has been found with beams of other ‘one-electron’ atoms, including those 
of hydrogen, lithium, and sodium. 

An explanation of all these facts was provided by the suggestion of 
Goudsmit and Uhlenbeck, in 1925, that an electron possesses an intrinsic 
angular momentum or spin momentum S and corresponding magnetic 
moment M. In other words, an electron is in some sense to be regarded 
as a charged particle spinning about an axis — although we shall see that 
this picture must not be taken too seriously. We assume that, in the presence 
of a magnetic field in the z-direction, S z is quantized in the same way as 
L z , and the quantized values may be deduced from the results of Stern 
and Gerlach. It follows from our previous discussion that if, as a result of 
the orbital angular momentum, a beam were split into 2/+1 beams, we 
should deduce that the quantized values of L z were - Ih , (-/+ l)h ,. . . Ih. 
In the same way, since a beam of ‘one-electron’ atoms is split into two 
beams as a result of the spin momentum (and 2/+1 =2 gives / = £), 
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we may deduce that the quantized values of S z are -\h and \h. Also, the 
amount of the separation of the beams shows that the corresponding values 
of M, must be eh/2mc and -eh/2mc. Thus, the ratio of the z-components 
of spin magnetic moment and spin angular momentum is 


M 2 

S 2 


e 

me 


(5.12) 


We see, however, from equation (5.3), that the ratio of the z-components 
of orbital magnetic moment and orbital angular momentum is 


/A _ _ ^ 

L. 2 me 


(5.13) 


The fact that these two ratios are different indicates that the picture of 
an electron as a spinning charged particle cannot be a true one, for this 
would lead to the latter ratio — none the less, it is a useful intuitive guide. 

The quantity eh/2mc is called the Bohr magneton and is often denoted 
by l 1 b- Thus, in the presence of a magnetic field H the energy of an electron 
will be changed, as a result of its spin, by + /u B H, corresponding to the values 
+ 2 ft Of iS-. 


5.3. Spin Wave Functions 

Electron spin appears quite naturally in the relativistic quantum 
mechanics of Dirac, and the correct value of the magnetic moment is ob¬ 
tained. Again, without invoking relativity, electron spin can be grafted 
on to the previously developed theory by means of the Pauli spin matrices. 
However, in all the problems we shall meet, electron spin does not appear 
in the Hamiltonian — at least, as a very good approximation — and this 
permits us to dispense with both of these elaborate theories. 

It will be sufficient for our purposes if we recognize that for each one- 
electron state that we have considered so far we now have two states, in 
one of which the electron has, say, positive spin, and, in the other, negative 
spin — the former corresponding to S z = and the latter to S z = — |ft. 
To incorporate the spin into our mathematical formalism we simply have 
to introduce a spin quantum number m s and a spin variable (, so that a com¬ 
plete designation of a one-electron state becomes 

( Pn.l,m I .m s (x i , yh, 0 - 

In future we shall always use (j) to denote a function of both the positional 
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and spin coordinates of an electron — there should be no confusion with 
the angular coordinate 0 — and shall retain i ]/ to denote an orbital function, 
that is, a function of the positional coordinates only. 

By analogy with the magnetic quantum number m t , we define m s so 
that S z = m s h, which means that the possible values of m s are +■£. Also, 
it is found that the success of the theory demands that the spin variable 
( be permitted to assume only two values, which we will arbitrarily take to 
be +1. This seems a little artificial, and indeed it is, but it is necessary if 
we are to obtain the desired results without completely changing our 
mathematical treatment. We have seen that it is not strictly possible to 
regard an electron as a spinning charged particle, and it would be meaningless 
to ascribe an angle of rotation to it. There is, in fact, no satisfactory physical 
picture of the spin variable (, and it is not necessary to have one, but if 
it is found helpful £ may be regarded as the cosine of the angle which the 
axis of spin makes with the z-axis — this angle can only have the values 0 
and n, however. 

Corresponding to each one-electron state we have considered previously, 
then, we now have two: 

^*n, I, mi, y> C) and y* z, £). 

We know that an electron which is not in an s state has a magnetic moment 
due to its orbital motion as well as its spin magnetic moment, and there 
must be an interaction between the two — we may picture the electron as 
an elementary magnet moving in the magnetic field created by its orbital 
motion. This is called spin-orbit coupling, and it means that the orbital 
motion of the electron, and its energy, depend upon the direction of its 
spin, even in the absence of an external magnetic field. In fact, the spin 
magnetic moment of the electron is forced to be in the same direction as 
the orbital magnetic moment, or in the opposite direction. This gives rise 
to the fine structure of spectra mentioned earlier. However, it may be 
shown, and this is borne out by the observed fine structure splitting, that 
the energy due to this effect is very small — it is smaller than the energy 
required to remove the electron from the atom by a factor of the order of 
(e 2 /hc) 2 , and e 2 /fic has the value 1/137. As far as the calculation of the 
energy of an atom is concerned, therefore, the spin-orbit coupling can 
generally be neglected. Then each of the above wave functions may be 
written as the product of an orbital function and a spin function, the orbital 
functions being the same: 
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y, z)Xi(0 and il/ n>hmi (x, y, z)x~ i (£). 

It is customary to denote the function by a and by /?, so that we have 

fin,I, mi ,$■(*» T* C) = ’ 0 a (0> 

<t>n.l,m h -t(x, y, Z, 0 = ' K,l.m,{x, T, z)0(C)- 

The meaning of the spin states is as follows: if an electron is in the positive 
spin state a, then S', is certain to have the value /i/2, while, if it is in the 
negative spin state ft, S z is certain to have the value — /i/2. In other words, 
when a magnetic field H is switched on in the z-direction, the energy of an 
electron in the state a will be increased by /r B H, and the energy of an electron 
in the state /? will be increased by — H. It will be seen below that this 
meaning is completely expressed, in a manner consistent with our previous 
interpretation of wave functions, if we permit the spin functions to have 
only the two values 0 and 1: thus, 

a(l) = l, ot(— 1) = 0, 

0(1) = 0, 0(— 1) = 1- 

The spin coordinate £ can only have the two values ± 1, so that the spin 
‘space’ consists of only two points. An integration of a function of £ over 
such a space would give a zero result. It is therefore necessary to replace 
an integration with respect to £ by a summation over the two values of £. 
For example, 

|a(C)| 2 dC = I KOI 2 = Kl)| 2 + K-l)| 2 = 1. (5-14) 

J {=±i 

It follows that the functions a and /? as defined above are normalized. 
They are also orthogonal, for the functions are real and 

f oc(£)P(Qd£ = £ oc( 00(C) = a(l)0(l)+«(-l)0(-l) = 0. (5.15) 

J £= ± 1 

We must also interpret KOI 2 as the probability that, in the state a, 
the spin coordinate has the value £. This is entirely consistent with the 
meaning we have given to the spin states, for 

|a(l)| 2 = 1 and |a(-1)| 2 = 0, 

which means that, in the state a, £ is certain to have the value 1 — in other 
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words, the spin momentum is certain to be in the z-direction, so that 

S z = h/2. 

Finally, we wish to introduce a new notation which will prove very 
convenient. In the previous work we have written an orbital function 
i j/(x, y, z ) concisely as ^(r). We shall in future let the vector x stand for 
the four coordinates (x, y, z, (), so that a one-electron wave function 
including spin may be written 4>(x ) instead of (j)(x , y, z, () or (p(r , Q. Such a 
function of the positional and spin coordinates, incidentally, is often referred 
to as a spin-orbital. Similarly, we shall let dx denote a volume element of 
configuration space including spin, so that 

d* = drd£. 

In this convention, then, if f(x) is any function of the positional and spin 
coordinates, we have 

jf(x)dx =jf(r, 0 drd( = C)dr. (5.16) 

As an example, we will consider the normalization of a spin orbital 
4>(x). Suppose that 

4 >( x ) = 

Then the normalizing condition 

J 4>*(x)(f>(x)dx = 1 

mean^ simply, 

Z f I Wl 2 KOI 2 dr = 1. 

± 1 ^ 

Putting in the two values of a, we see that this equation reduces to 

J |i/d»| 2 d r = 1. 

Hence, if the orbital function is normalized, so is the spin-orbital — which 
could have been deduced immediately from the fact that the spin functions 
are normalized. 

This completes the formal theory of electron spin as far as we shall 
require it in the remainder of the book. 
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5.4. The Pauli Principle and the Symmetry of Wave Functions 

In the case of a many-electron atom it may be deduced from the ex¬ 
perimental data that, if two electrons are in the same orbital state, they 
must have opposite spins. For example, an atom of helium in its ground 
state is found to have no magnetic moment, so that the two electrons, which 
are both in Is orbitals, must have their spin magnetic moments opposed. 

If the spin is included in the specification of a one-electron state, there¬ 
fore, the Pauli principle may be expressed as follows: 

not more than one electron may occupy any state. 

This is clearly equivalent to our previous statement in terms of orbital 
states only. It is still incomplete, however, in that it refers to the one- 
electron approximation. Before giving a more general statement of the 
principle we must consider the symmetry of many-electron wave functions 
with respect to the electronic coordinates. 

We first note that electrons are indistinguishable. In experiments on many- 
electron systems it is impossible to label the electrons. Thus, we may observe 
that there is a certain number of electrons in a given set of states, but we 
can never know which electron is in any particular state. This means that 
an observable quantity which depends upon the electronic coordinates 
must be symmetric with respect to those coordinates — in other words, 
any interchange of the coordinates of the electrons, including the spin 
coordinates, must leave the observable quantity unchanged. 

Now the wave function W is not itself a physically observable quantity, 
but |!F| 2 is. Thus, any interchange of the coordinates of two electrons must 
leave the value of | f'| 2 unaltered. For example, interchanging the coordinates 
Xj and x 2 , which include the spin coordinates, we must have 

I'J'Oi, x 2 , . . • *„)| 2 = \V(x 2 , x*| 2 , 

for an iV-electron system. In general, if the operator P represents any one 
of the Nl different permutations of the electronic coordinates, then 

in*i, * 2 , • • • **)i 2 = \pn*i,x 2 , ... %)i 2 . (5.i7) 

Now let us examine the implications of this for the wave function T. 
If the operator P 12 represents the interchange of x 1 and x 2 , it follows from 
equation (5.17) that 

W(x 2 , x lt ...x N ) = Pi 2 ^(xi, * 2 >. . . x N ) 

= e'” 1 ¥(x„ x 2 ,. . x„). 


(5.18) 
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where y l2 is a real constant to be determined. Equation (5.18) is seen to 
correspond with equation (5.17) if we take the moduli of both sides, for 
|e' m | = 1. If we apply the operator P 12 again to equation (5.18), we find 



P 12^(* 2 , *1. 

• • • x N ) 

= W(x 

15 * 2 j • • • *jv) 



= P 12 P 12^(*1> *2, 

• • • x N ) 

= e 2iy 

12 ^(x l9 X 2 ,... x N ). 

(5.19) 

Thus, 







e 2, ’ m = 

1, or 

e im 

= +lf 


so that 







•F(x 2 , Xi, . . . 

x N ) = 

±n*i 

, X 2 5 • • • ^jv)j 

(5.20) 


and the same result holds true for the interchange of the coordinates of 
any pair of electrons. In other words, the function IF must be either sym¬ 
metric (if it does not change sign) or antisymmetric (if it changes sign) 
with respect to the interchange of the coordinates of any pair of electrons. 

It can also easily be seen that *F cannot be antisymmetric with respect 
to one pair of electrons and symmetric with respect to another pair, for 
this leads to inconsistency. Thus, suppose *F were antisymmetric with 
respect to x l and x 2 , and symmetric with respect to x 2 and x 3 ; we should 
then have 

*T(*l5 X 2 , X 3 ) = — lF(x 2 , Xi, x 3 ) = — !F(x 3 , Xi, x 2 ) 

— x 2 , Xj) = 1F(x 2 , X 3 , X t ) = — lF(x l5 X 3 , X 2 ) 

= - 5 P (* l5 x 2 , x 3 ). ( 5 . 21 ) 

It therefore follows from the indistinguishability of electrons that IF 
must be either symmetric in all the coordinates or antisymmetric in all 
of them, but, in order to decide which, we must appeal to experiment. In 
fact, what may be regarded as the generalization of the Pauli principle 
simply states that 

for a system of electrons, the total wave function must be antisymmetric 
in the. electronic coordinates, including spin. 

We shall show in the following section that when this antisymmetry 
principle is applied to the one-electron approximation it leads to the Pauli 
principle in the form given previously, whereas this would not be so if a 
symmetric total wave function were assumed. It is found that systems of 
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protons and neutrons also must have anti symmetric wave functions, while 
a system of photons, for example, must have a symmetric wave function. 

5.5. Determinantal Wave Functions 

Let us suppose that a set of one-electron wave functions of a system 
of N electrons has been obtained, by the Hartree method for instance, and 
that these are, with the appropriate spin functions included, 

<£i(*i)> <f> 2 O 2 ), • • • 4>n(*n), 

where, for example, 

The only approximate total wave function we have so far considered in 
detail has been of the simple product type: 

0 1 (-'‘I) (Piixi) • • • 

Clearly any such product, with the electronic coordinates rearranged in 
any order, would be equally acceptable and would give the same energy. 
None of these single product functions obeys the antisymmetry rule, 
however. 

A more general function may be constructed by taking a linear com¬ 
bination of all such products; thus 

X a p^ >< i ) i (*i)^2(*2) • • • (5.22) 

p 

where the a P are constants and the sum is over the TV! permutations P of 
the electronic coordinates. This function is antisymmetric, as required, 
if all the a P have the same absolute magnitude and positive or negative 
sign according as P is an even or an odd permutation, that is, according 
as P corresponds to an even or an odd number of interchanges of pairs 
of electrons. So long as is not required to be normalized we may take 
the magnitude of a P to be unity, and write concisely 

¥ = I (-1^P0x(*i)0 2 (*2) • • • <£*(**)> ( 5 - 23 ) 

p 

where p is the number of interchanges in the permutation P. For example, 
p = 2 for the permutation which changes (123) into (312), for this per¬ 
mutation is equivalent to interchanging 1 and 2, and then 2 and 3 — or 
1 and 3, and then 1 and 2. 
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The wave function (5.23) may be written in the determinantal form (see 
Appendix 3) 


<M*i) ^i(* 2 ) • • • 4 > i ( xn ) 

4> 2 (x l) 4> 2 O 2 ) ■ ■ • 4>2{*n) 


(5.24) 


V = 


(f> N (x i) ^ N (x 2 ) • • • 4>nM 


This makes it clear that W is antisymmetric in the electronic coordinates, 
for if we interchange any pair, say JCj and x 2 , we interchange two columns 
of the determinant and hence change its sign. It can be proved, in fact, 
that this determinantal function is the only antisymmetric function of the 
form (5.22), but the proof requires group theory and we will omit it. 

The Pauli principle follows immediately, for, if two of the one-electron 
functions <j> t are the same, then two rows of the determinant are the same 
and its value is identically zero. This is not permissible for a wave function, 
and so we have the fact that no two electrons may be in the same one- 
electron state. It is easily seen that a symmetric function of the form (5.22), 
which is obtained by letting all the ap have the same magnitude and the 
same sign, does not lead to the Pauli principle, for if two of the one-electron 
functions are the same in this case the total wave function does not vanish 
— indeed, for those systems of particles which are described by symmetric 
wave functions, all the particles may be in the same one-particle state. 

5.6. The Expectation Value of a Symmetric Operator and the Normaliza¬ 
tion of a Determinantal Wave Function 

The wave function W given in equation (5.24) is not normalized even 
when the one-electron functions <t> t are normalized. The normalizing 
constant depends upon the fa, in general, but can easily be evaluated if 
these functions are orthogonal as well as normalized, that is, if 



In this case, as we shall prove below, the normalizing constant is 1/ViV!, 
so that the normalized determinantal wave function is 



(5,26) 
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Before proving this it will be useful to present a simple theorem relating 
to the expectation value of a symmetric operator F, which is defined as 

< j p>=JwPdT', (5.27) 

the prime on the volume element indicating that a summation over the 
values of the spin variables is included in the integration throughout the 
configuration space of the system; thus, 

dr' = dx x dx 2 . . . dx A . 

A symmetric operator is one which is symmetric in the electronic coor¬ 
dinates, so that PF = F, where P is a permutation operator as previously 
defined — the Hamiltonian is an example of a symmetric operator. The 
theorem is as follows: 

for a symmetric operator F and normalized determinantal wave function 
F, we may write 

J <F*F<Pdt' = VW! J • • • M*h) dt'- (5.28) 

In other words, in the expression for < F} we may replace the determinant 
F by ViV! times the product of its diagonal elements. It is assumed, of 
course, that the are orthonormal, as in (5.25). The proof is straight¬ 
forward. We have, using (5.26), 


I 


F*FF dV = 


—L f F*F £ (- 1) p P<M*i)0 2 (*2) • • • </>nOn) dr' 

ViV ! J p 

= 4= I (- ] ) pp f (P-'v'WxMMtz) ■ ■ ■ M*nW 

\JN\ p J 

= -i= I P f F*F<l> 1 (x 1 ) (f) 2 (x 2 ). . . 4> n (x n ) dr' 

ViV! p J 


= ViV! F*F4> 1 (x 1 )(I) 2 (x 2 ) . . . <t> N (x N ) dt'. 


(5.29) 


In the second line of this equation we have transferred the summation over P, 
and the operator P, to the left of the integral sign, and this necessitates 
our annulling the effect of P upon F* by introducing the operator P -1 
(which is to operate upon F* only). The fact that P now operates upon F 
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is of no consequence, since P F = F, which is the reason for specifying a 
symmetric operator. Now the number of interchanges in P -1 is just p, 
the same as in P, since the two operators affect the same interchanges in 
reverse order. Hence 

= ) P W*, (5.30) 

for P -1 merely produces p interchanges of pairs of columns of the deter¬ 
minant. Since there is a factor (—l) p already present, and (— l) 2p = 1, 
we obtain line three of equation (5.29). Finally, we note that the value of 
a definite integral is independent of the symbols used for the variables of 
integration. Hence, the integral in line three has the same value for every 
permutation P, and there are iV! permutations in all. So we arrive at the 
required expression. 

We shall find this theorem particularly useful when calculating the 
energy of a metal, in the following chapter. Meanwhile, we may use it to 
show very easily that the function IP given in (5.26) is normalized. The 
operator F is in this case unity, and is symmetric in the electronic coor¬ 
dinates, for it does not depend upon any of them. Thus, 

J= VniJ wtiMUxJ ■ ■ ■ 

= J[E(-WfOO- • • 

=J I‘M*i)I 2 I<# , 2 (* 2 )I 2 • • • IfcONI 2 ^' 

= Jl'#’i(*i)| 2 dx 1 . . .J|<M>„)| 2 d*,\' = 1. (5.31) 

all the other terms of •F* giving zero contribution owing to the orthogonality 
of the (pi. 


5.7. The Ground State and the Excited States of the Helium Atom 


As a simple example of the inclusion of electron spin and the use of a 
determinantal wave function, let us consider again the application of the 
one-electron approximation to the helium atom. It will be recalled that 
the Hamiltonian of this system is 


2m 


2e 2 


2e 2 e 2 

-1-> 

r 2 I’12 


(5.32) 
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and, if the electronic interaction term e 2 jr l2 is neglected, the Schrodinger 
equation separates into two hydrogenic equations of the form 

For simplicity, in the following work, we will choose our one-electron 
orbitals to be eigenfunctions of this equation, and treat the term e 2 /r l2 
by first-order perturbation theory. Our procedure will therefore be similar 
to that of § 3.6.1, except that we will now consider excited states also and 
will use properly antisymmetrized wave functions. Although we have seen 
that first-order perturbation theory gives a reasonable result for the 
ground state, we have also seen that the Hartree method gives a better 
one, so that it would be more reasonable to use the Hartree orbitals in 
constructing our determinantal wave functions. Indeed, the general procedure 
is the same whatever type of orbitals is used, but the advantage of using the 
eigenfunctions of equation (5.33) is that they are orthogonal, whereas the 
Hartree functions are not, and this orthogonality makes it easier to nor¬ 
malize the total wave function. We are more concerned with demonstrating 
the general effects of using a determinantal wave function than with obtaining 
an accurate energy value. 

(i) Ground state. In the ground state we know that both electrons occupy 
the same Is orbital, whose normalized wave function i J/(r) we are assuming 
to be the eigenfunction of equation (5.33) corresponding to its lowest 
eigenvalue, — 4 ryd. It follows that the electrons must have opposite spins, 
and the two one-electron states may be written 

<M*) = <A0M0> <M*) = <A0)/?(C). 

The normalized total wave function, from equation (5.26), is thus 

_ 1 1 ) ( j ) 1(^2) 

T ° ~ ~r I 

v 2\4> 2 (x 1 ) 4> 2 (x 2 ) 1 

= 7! [<M*l)<M*2)-0l(* 2 )02(*l)] 

= ^ iK'iM' 2)[«(C 1 )^(C 2 )-a(C 2 )i9(C 1 )]. 


(5.34) 
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The energy of the ground state, given by this wave function, is 



<5.35) 


since the spin factor is. unity when and C 2 have opposite signs, and 
vanishes when they have the same sign. Thus, the energy in this case is the 
same as that given by the simple product wave function «K l 'i)<K , ' 2 )> which 
was used in § 3.6.1. This is always true, even for excited states of the atom, 
if both electrons are in the same orbital, but it is not true for excited states 
in general, as we shall see. 

Since the electrons must have opposite spins, states in which both electrons 
are in the same orbital can be described by a single determinantal wave 
function only, and are thus non-degenerate. When the determinant is 
expressed as a product of an orbital function and a spin function, as in 
equation (5.34), we see that the orbital part is symmetric in the positional 
coordinates, while the spin part is antisymmetric in the spin coordinates 
of the electrons. Non-degenerate states are known as singlets. 

(ii) Excited states. Let us now consider an excited state in which the two 
orbitals are different, the normalized wave functions 1 j/ a (r) and i// 6 (r) being 
eigenfunctions of equation (5.33) corresponding to the different eigenvalues 
s a and e b . In this case the electrons need not have opposite spins, so that 
there are four possible determinantal wave functions corresponding to the 
four ways of associating spin functions with the orbitals. These are as 
follows: 


^ = _L •AoOtMCi) ^ 0 ^ 2 )«(C 2 ) 

V 2 •Ai.O'iMCi) 2 ) a (C 2 ) 

= ~J= \- l l / a( r l) { l / b( r 2)~ l l / a( r 2) l l / b( r l)] a (Cl )^(C 2 )> 


= ^a(r i)0(Ci) I K(r 2 )K^2) 

’ V2 M *i)flf,) U'iMz) 

= 7! [»A^(»*i)«Afe(»‘2)—‘A a (*' 2 )'A6(»'i)] /5(c 1 )yS(C 2 X 
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= , Ao(>'i)a(Ci) '/'aOaMCa) 

V 2 'I'bir i)^(Ci) 

= 7f C^i)^(»‘2)a(C 1 ))8(C 2 )-^(r 2 )^(r 1 )a(C 2 )i?(C 1 )], 


^ = _L ^i)^(Ci) 

V 2 ^(^KCO ^(r 2 )a(C 2 )| 

= 7 | [*Aa(^" i) *A6(^*2) ®(C 2 )^(Ci) 0(^*2) 1)/^(C 2 )J • 


The wave functions are all normalized, since ij/ a and \]/ b are orthonormal. 
When interaction between the electrons is neglected — that is, for the 
unperturbed system — the four wave functions are degenerate, each of 
them corresponding to the unperturbed energy e a + e fc . Instead of using *F 3 
and f / 4 , therefore, we may construct linear combinations of them, as 
follows: 



These functions are no longer normalized, but become so if we multiply 
each by a further factor l/>/2, as may easily be verified. They have the 
advantage that the orbital and spin functions are separated, as in 
and W 2 > and a further advantage which we will now discuss. 

Since the unperturbed state is four-fold degenerate, the first-order 
perturbation energies must be obtained by setting up, and solving, the secular 
equation for the perturbation e 2 /r 12 or for the whole Hamiltonian H, 
as explained in § 3.7. Now, the fourth-order secular determinant may be 
constructed in terms of W lf W 2 , ^ 3 , and 1P 4 , and this will lead to the 
correct perturbed levels. If, however, we construct the secular determinant 
in terms of the normalized wave functions 



(5.36) 


as is perfectly permissible, we find that the off-diagonal elements all vanish, 
so that the perturbed energy levels are given immediately by the equation 



(5.37) 
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where W stands for any one of the functions (5.36). That this is so may 
be easily understood if it is noticed that the spin factors of these functions 
are orthogonal, for each off-diagonal element contains the integral of the 
product of two of them. 

The wave functions (5.36) are therefore the best linear combinations of 
the determinantal functions to represent approximately the perturbed states 
of the system. It may be seen that the functions 

•P., V 2 , and + ¥'„), 

all have the antisymmetric orbital factor 

while the remaining function 


has the symmetric orbital factor 

M t ‘i)'/ / b(r 2 ) + 'J' a ( r 2 )M r i)]- 

On the other hand, the spin factors of the first three functions are sym¬ 
metric in the spin coordinates, while the spin factor of the last function 
is antisymmetric, so that the four wave functions are antisymmetric in 
all the electronic coordinates, as required. 

Now, the normalized spin factors do not affect the energy values given 
by equation (5.37) — for example, 

f = i £ f f W(>',)«(r 2 )-«r 2 )«('' l )] 

X H i^a(t\)tb(r 2 )-^(r 2 )^b(>\)]HCi)\ 2 HQ\ 2 di^di-2 

x V'nO'i) <Aft(»‘2) - W'u)i)] d r 1 d r 2 ■ 

Thus, there are only two perturbed energy levels, corresponding to the 
symmetric and antisymmetric orbital functions respectively: 

e =ijy MVMMi 

x H OaO 1 ) i/'ftO'z) ± <A« 0'2 ) 1 )] d r 1 d r 2 . 


(5.38) 
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Using equation (5.32) and remembering that t J/ a and ij/ b are the ortho¬ 
normal eigenfunctions of equation (5.33), with eigenvalues e a and e b , 
we find 

E = fi - + 8 ‘+JJ f~ [l^)l 2 lW r 2)l 2 + l^r 2 )| 2 |^r 1 )| 2 

±{^*( r i)^t(r 2 )^a(r2)^b(r 1 ) + \l/Xri)H r 2)^(r 2 )\l/t(r 1 )}]dr 1 dr 2 . (5.39) 

The contributions to the integral from the first two terms in the square 
brackets are equal, and also the contributions from the two terms in the 
curly brackets are equal, for the terms of each pair have the same form — 
they differ only by the interchange of r x and r 2 , and this does not affect 
the value of the double integral. Hence, 

E = e a +e b + C±J, (5.40) 

where 

C = j j —Wair XI 2 \*J/ b (r 2 )\ 2 dr! dr 2 , (5.41) 

and 

J = \ {~- } Pa(>‘ 1 )\pt(>‘ 2 )\J/ a ( r 2)'Pb(ri)dr 1 dr 2 . (5.42) 

J J r 12 

The interpretation of C is straightforward: it is the energy due to the 
Coulomb interaction between the charge clouds of the electrons. This term 
would occur even if a simple product type of wave function were used. 
The integral J, however, appears only because an antisymmetric total wave 
function has been used. This antisymmetry ensures that the total charge 
density remains unchanged when the two electrons are exchanged, and J 
is called the exchange integral, the last term in (5.40) being the exchange 
energy. 

The integrand of C is always positive, so that the integral is positive. 
The integrand of J will in general be positive for some values of iq and r 2 
and negative for others. None the less, the integral J is positive, as may 
roughly be understood in the following way. 1 " When r t and r 2 are very 
nearly the same the integrand of J has almost the same value as that of C, 
which is then very large and positive owing to the small value of r i2 . On 
the other hand, if the integrand of J is negative, either i/'J/i) and t j/ a (r 2 ) 
or t/'(,(»*!) and $ b (r 2 ) must have opposite signs, which implies that r t and 


t We assume, for simplicity, that the wave functions are real. 
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r 2 are in general quite different in this case, so that r 12 is large and the 
integrand is small. The contributions to the integral from regions where 
the integrand is positive thus outweigh those from regions where the in¬ 
tegrand is negative, making the integral positive. 

The lower energy level is, therefore, 

E t = £ a + £f,-(-C— J. (5.43) 


and is triply degenerate — it is called a triplet. The three wave functions, 
all having the same antisymmetric orbital factor, are 

«C.)«Q ) . (5-44) 


-(y bHr 1 )<li b (r 2 )-<liX r Mb( r i)'i 


|^MC,)«Q + sc(C 2 )/i(C,)] 


the orbital and spin factors being separately normalized. These three states 
are, of course, degenerate only if there is no external magnetic field — we 
are consistently neglecting spin-orbit coupling. In the first state both 
electrons have positive spins, in the second both have negative spins, and 
the third is a linear combination of states in which the spins of the electrons 
are opposed, the corresponding values of the total z-component of the 
spin momentum being h, —h, 0. In other words, if a magnetic field H is 
switched on in the positive z-direction, the energies of the three states will 
be increased by 2 p B H, —2p B H and 0, respectively, so that the degeneracy 
will be removed. 

The higher energy level, 


E s — £ a + £{,-(- C + J, 


(5.45) 


is non-degenerate, and is therefore a singlet, as is the ground state. The wave 
function 


l'l'a(ri)'Pb(r 2 ) + '!'a(r2)'l'b(riy] 0(CiMC 2 )-a(C2)/?(Ci)] (5.46) 

has a symmetric orbital factor. The spins of the electrons are opposed, 
so that the total z-component of the spin momentum is zero, and the energy 
of the singlet state is unchanged in a magnetic field. 

It must be realized that i j/ a and i j/ b are eigenfunctions of a hydrogemc 
equation, and all eigenvalues e a , e b are degenerate, except the lowest. For 
any unperturbed level £ a + £ 6 , therefore, apart from the ground level, there 
will be a larger number of determinantal wave functions than we have con¬ 
sidered — we have only dealt with the possibilities arising from a particular 
choice of i j/ a and i J/ b . This is sufficient for our purpose, which was not to 



Ch. 5, § 7, 8] 


THE HYDROGEN MOLECULE 


127 


obtain all the energy levels of helium, but merely to give an example of the 
use of determinantal wave functions. When we come to calculate the energy 
of a metal we shall consider only the ground state, which is non-degenerate, 
so that there will be just one determinantal wave function to contend with. 

5.8. The Hydrogen Molecule 

The only examples we have considered so far have been systems containing 
a single centre of force the nucleus — apart from the interactions among 
the electrons themselves. Before going on to consider metallic systems, 
which contain very many atoms, it will be instructive to consider a simpler 
example, the hydrogen molecule, which consists of two protons and two 
electrons. It is not our intention, however, to give anything like a full 
account of the theory of molecular structure, of which there is a con¬ 
siderable literature. Admittedly, a metal may be regarded as a large molecule, 
but many of the methods which are applicable to small molecules are not 
appropriate to metals, and it would be merely confusing to describe them. 
None the less, the hydrogen molecule provides a simple illustration of 
the use of two different kinds of approximate orbitals, both of which are 
encountered in the theory of metals. 

In our treatment of a free atom we have assumed as a good approximation 
that the nucleus is at rest, but in a molecule the motion of the nuclei relative 
to one another gives rise to important effects which cannot be neglected. 
Fortunately, again owing to their much greater mass, the nuclei move very 
slowly compared with the electrons, and, as shown originally by Born and 
Oppenheimer, this makes it possible to deal with the electronic and nuclear 
motions separately. In our present case, we may solve the Schrodinger 
equation for the hydrogen molecule on the assumption that the protons are 
fixed at a distance R from each other, and so obtain the energy E, which 
does not include the kinetic energy of the protons, as a function of R. 
This energy E(R) may then be used as the potential energy function in a 
Schrodinger equation for the nuclear motion only. We will, however, 
confine ourselves to describing the calculation of E(R). 

If we assume that the protons are fixed at a distance R apart, and label 
the protons A, B and the electrons 1, 2, the Hamiltonian operator is 



the distances r Al , r B1 , etc., being those shown in Figure 5.2. 
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(i) Atomic orbitals. Let us first consider the case in which the internuclear 
distance R is very large, when the ground state 
of the system is that of two normal hydrogen 
atoms. The one-electron wave functions will 
then be hydrogen atom functions centred on 
the two protons, so that in the ground state 
they will both be Is functions of the form 
e _r/ "°, where r is the distance from the ap¬ 
propriate nucleus and a 0 is the Bohr radius. 

Thus, if we denote the two functions by \[/ A 
and i J/ B , and the position vectors of the two electrons with respect to some 
origin by r x and r 2 , we have 

Mr) = e “ rWao > Mi) = e~ rA2,a ° 

Mi) = e~ rBl/a °, Mi) = e -rB2/ao , 

These functions are not normalized, of course. 

The total wave function of the system may then be one of four determinants 
of the type 

>/o(»T)Xa(Ci) ^a^Xa^i) 

•/'bOt)Xb(Ci) 

where the spin functions Xa, Xb can be either a or /?. These functions are 
of the same form as those we obtained for the excited state of the helium 
atom. Also, if the hydrogen atoms are so far apart that the interaction 
between them is negligible, the four functions are degenerate, so that we 
may combine them linearly, again as for the helium atom, to obtain three 
functions which have the antisymmetric orbital factor 

= ^ A ( r i)Mi)-^ A ( r i)^i^i\ 
and one which has the symmetric orbital factor 

= ^ 1 )^ 2 ) +W'2)<Ab(>i)- 

The spin factors are such as to make the complete wave function antisym¬ 
metric in each case — they are given in equations (5.44) and (5.46). 

Of course, when R is so large that there is virtually no interaction between 
the two atoms, the calculation of the total energy presents no difficulty — 
it is just twice the ground state energy of the hydrogen atom. However, 


2 



Fig. 5.2. The hydrogen mole¬ 
cule. 
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these same wave functions 9 / + and may be used as a first approximation 
in calculating the energy for small values of R, when the interactions of 
the electrons with each other and with both nuclei have to be taken into 
account. This is known as the Heitler-London method, or the method of 
atomic or localized orbitals. 

W+ and now correspond to different energies, and these may be cal¬ 
culated in the usual way by means of the formula 


*. _S'P±H'P ± dT 
±= Jfidt 

It should be noted that the functions ij/ A and ij/ B are not orthogonal, so that 
the normalizing constant, which has not been included in < R+ and !£_, 
is not just 1 / 72 , as for helium, even when i j/ A and t j/ B are normalized, but 
is a function of the internuclear distance R. Equation (5.48) is, of course, 
correct whether the wave functions are normalized or not. 

Using the fact that both iJ/ A and i j/ B satisfy equations of the type 

(- —) ^(u) = #a(»r), 

\ 2m r A J 



where e is the ground state energy of a hydrogen atom, it is found after 
some manipulation that 


£± 


2e+ — + 

R 


Ei i + E 12 

C 2 + C 2 ’ 


(5.49) 


where 


Eu =|'Aa(» , i)d»'i = JViO’Od'i. 

S 12 = J^O^l)^!. 

En = e 2 f f ^1)^2) (-- 1 " —) dudr 2 , 

JJ w 12 Y A2 r BV 

£12 = c 2 f f il/ A (r 1 )tj/ A (r 2 )il/ B (r 1 )tJ/ B (r 2 ) - 1 -—) dr t (ir 2 . 

JJ 12 r A2 r BV 

All these integrals may be evaluated with a little difficulty, and the 
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resulting energy is shown as a function of R in Figure 5.3. We see that 
only E+(R), the energy of the 
symmetric state X F + , has a mini¬ 
mum, so that only this state gives 
rise to a stable molecule. In the 
antisymmetric state 'F_ the hy¬ 
drogen atoms repel each other 
at all distances. If we consider the 
spin functions which correspond 
to these orbital functions, as we 
did for the helium atom, we 
deduce that, if the electrons in two 
hydrogen atoms have parallel spins, 
the atoms will repel each other, 
whereas, if the spins are anti¬ 
parallel (opposite), the atoms will attract each other and form a stable 
molecule. The binding energy of the molecule, which is the depth of the 
minimum of E + below the energy of two free hydrogen atoms, is given by 
this method as 3.14 eV, compared with the experimental value 4.72 eV. 
The equilibrium internuclear distance is found to be 0.87 A, compared with 
the experimental value 0.74 A. 



Fig. 5.3. Energy curves for the hydrogen 
molecule (Heitler-London method). 


(ii) Molecular orbitals. Although the Heitler-London method is certainly 
correct at very large internuclear distances, it is equally certainly not correct 
at small internuclear distances, when the atomic wave functions centred 
on the two nuclei overlap considerably. An alternative method, which 
suggests itself in the latter circumstances, is to calculate the one-electron 
orbitals on the assumption that each electron is shared equally by both 
nuclei. That is to say, in the ground state, both electrons will occupy the 
same orbital but with opposite spins, as in the ground state of the helium 
atom, and this orbital, instead of being centred on a particular nucleus, 
will extend symmetrically around both (see Figure 5.4). Such orbitals are 
called molecular orbitals or extended orbitals or, in the electron theory of 
metals, Bloch orbitals. 

In principle, molecular orbitals may be calculated by the Hartree method. 
Each electron, in the present case, would be assumed to move in the field 
due to the nuclei and the average charge distribution of the other electron, 
and the procedure would be as described for the helium atom in § 4.4. In 
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practice, however, self-consistent field calculations are very difficult for 
molecules, owing to the lack of spherical symmetry of the potential function, 
so that generally approximate methods are used to construct molecular 
orbitals. The simplest approximate method is that known as the method 
of Linear Combinations of Atomic Orbitals, or LCAO, which we will now 
describe. 

In the immediate vicinity of each nucleus we should expect the molecular 
orbital to be not very different from what it would be if the nuclei were 
completely isolated from each other. In other words, in the vicinity of 
each nucleus, we should expect the molecular orbital to resemble an atomic 
orbital of the type discussed above. This suggests that we might obtain 
reasonable approximations to molecular orbitals by adding atomic orbitals 
centred on the different nuclei, with appropriate numerical coefficients — 
that is, by taking linear combinations of atomic orbitals. The numerical 
coefficients may be chosen by the variational method, as described in 
§3.4. For the ground state of the hydrogen molecule, however, considerations 
of symmetry permit only two possible molecular orbitals, which we will 
denote by i j/ s and t j/ a ; thus, 

<AsO) = •AaOH'/'bO). 

•AaO) = </'aO)- , /'bO)> 

where r can be either r t or r 2 — we assume that both electrons occupy the 
same orbital. 

Graphs of the functions \j/ A , ij/ B , \j/ s and ij/ a along a line through the 
nuclei are shown in Figure 5.4. The function \[/ s has no nodal surfaces, 
but \j/ a has a nodal plane through the mid-point of AB. We therefore expect 
\J/ S to correspond to the lower energy. Not only is this borne out by cal¬ 
culation, but it is found, in addition, that t j/ a does not even correspond 
to a stable molecule. We therefore reject \j/ a and accept t j/s as the correct 
molecular orbital within the LCAO approximation. Since both electrons 
occupy i/^, with opposite spins, the orbital part of the total wave function 
is simply 


V = </'sO'i)<As(>2) = [*Aa(o) b(g)] [*A . 4 (^* 2 ) T </* 3 (^* 2 )] • (5-50) 

This must, of course, be multiplied by an antisymmetric spin factor, just 
as in the ground state wave function of the helium atom given in equation 
(5.34). 
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The function W is not normalized, but, as before, the energy is given by 

eJ™*. (5.51) 

J W 2 dz 

We will not give details of the evaluation 
of E as a function of R, but merely state the 
results relating to its minimum value. The 
binding energy is found to be 2.68 eV, 
compared with the experimental value 4.72 
eV, and the equilibrium internuclear distance 
is found to be 0.85 A, compared with the 
experimental value 0.74 A. The binding 
energy given by the LCAO method thus 
compares unfavourably in the present case 
with that given by the Heitler-London 
method, which was 3.14 eV. However, the 
calculated binding energy can be considerably 
improved by using molecular orbitals better 
than those constructed by LCAO — the 
best value is 3.60 eV. 

It should be emphasized that the molecular orbital method is quite 
wrong at large internuclear distances. This may be seen by comparing 
the function ¥ with the Heitler-London function W +> which gives correct 
results at large internuclear distances. From equation (5.50) we obtain 

whereas 

V+ = 'J'A(r l )'i'B( r 2) + 'l'A(r2)'l'B( r l)- 

The functions differ in that contains two additional terms, > I / a( i 'i)^a( 1 ‘ 2 ) 
and iAB(r 2 ), which represent states in which both electrons are on the 
same nucleus. We know, however, that the ground state at large inter¬ 
nuclear distances is that in which one electron is on each nucleus, so it is 
clear that T will not give the correct energy. 

At very small internuclear distances, on the other hand, the molecular 
orbital method is superior. This is clear from the results quoted above, 
but may be seen without detailed calculation. If we take the extreme limiting 
case when R becomes zero and the two nuclei unite, then as far as the 
electronic wave functions are concerned we simply have a helium atom. 



Fig. 5.4. Wave functions for the 
hydrogen molecule, along a line 
through the nuclei A, B. xp s and 
V’a are molecular orbitals ob¬ 
tained by taking symmetric and 
antisymmetric linear combina¬ 
tions, respectively, of the Heit¬ 
ler-London functions xp A and xp B . 
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The Heitler-London method, however, will still give the one-electron func¬ 
tions as hydrogen atom functions, whereas the molecular orbital method 
(although not the LCAO approximation) will give the proper helium atom 
functions. 

It should be pointed out that the Heitler-London method can itself be 
improved — for example, by using orbitals of the atomic type which are 
not hydrogen atom functions but, let us say, of the form e -cr , the parameter 
c being calculated as a function of R by the variational method. In this 
way results can be obtained which are slightly better than those given by 
the best molecular orbital calculation. Of course, both the Heitler-London 
method and the method of molecular orbitals can be used as the basis of 
perturbation calculations in order to improve the results still further. In 
general, one cannot say that either method is correct, but one or the other 
will give better results in given circumstances and will be the more con¬ 
venient as a basis for more accurate calculations. In dealing with metals 
it is found that molecular orbitals are more convenient, and we shall use 
these exclusively in the following work — in Chapter 8 we shall again 
meet the LCAO approximation, which is known in the theory of metals as 
the approximation of tight binding. 

The difference between the Heitler-London and molecular orbital methods 
lies in their different treatments of correlation between the motions of the 
electrons, a topic already mentioned in § 4.4 in connection with the Hartree 
method and one which will occupy us a good deal later in the book. The 
Heitler-London method includes a large degree of correlation in that the 
electrons are kept apart on separate nuclei, whereas in the molecular 
orbital method, as will be seen in the following chapter, the only correlation 
which appears is due to the Pauli principle, and this does not affect the 
energy of the hydrogen molecule since there are only two electrons and 
they may therefore occupy the same orbital, provided they have opposite 
spins. The reason why both methods give rather inaccurate results is because 
neither takes proper account of correlation. 

In conclusion, it must be remarked that, owing to the relative simplicity 
of the hydrogen molecule, it has been found possible to apply the variational 
method directly, with analytical trial functions, of great complexity, which 
include the effects of correlation, and results have been obtained which 
agree almost exactly with experiment. This method is not applicable to more 
complicated systems, however, which is why we have concentrated on the 
approximate methods described above. 


CHAPTER 6 


METALLIC COHESION AND THE HARTREE-FOCK 

METHOD 


6.1. The Cohesive Energy of a Metal 

Solid metals have a crystalline structure. That is to say, the nuclei of 
the atoms which constitute a metal are arranged on a space lattice or crystal 
lattice — in other words, they are arranged in a regular spatial pattern 
which repeats itself throughout the volume of the metal. As a matter of 
fact, a lump of metal of macroscopic dimensions generally consists of a 
mass of tiny crystals tightly packed together, or at least has irregularities 
in its crystal structure. The mechanical properties of metals, from an 
engineering point of view, depend strongly upon these crystal imperfections, 
but for most theoretical purposes they are not important and it is sufficient 
to consider a metal as a perfect crystal. We shall describe the most important 
crystal structures of metals in Chapter 8. 

Of course, metals are not the only crystalline solids. Indeed, we may say 
that all true solids are crystalline — so-called amorphous, or non-crystalline, 
solids, such as glasses, are more properly regarded as highly viscous liquids. 
The structure of non-metallic solids, however, particularly when more 
than one kind of atoms are present, may be more complicated than those 
of metals. The discussion of the present chapter can be applied, with slight 
modifications, to all solids. The true distinction between metals and other 
solids will appear later. 

The nuclei are never static, even at the absolute zero of temperature, but 
undergo a vibrational motion. Owing to the relatively large masses of the 
nuclei, however, this vibrational motion is very slight, and as far as energy 
calculations go, particularly at low temperatures, it can generally be neglected. 
Only in a very accurate calculation of the energy of a fight metal, such as 
lithium, is the vibrational energy of the nuclei of any significance. In all 
our future work, except where otherwise stated, we will assume that the 
temperature is absolute zero, and will ignore the zero-point vibrations of 
the nuclei. 
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The first step in calculating the energy of a metal is to set up the Hamil¬ 
tonian operator for the system of nuclei and electrons, just as we have done 
for single atoms. This time, however, we have a large number of atoms — 
about 10 23 per cubic centimetre. Let us say that the metal is composed 
of N atoms, each with Z electrons, so that there are ZN electrons, each 
with charge —e, and N nuclei, each with charge Ze. 

As before we will neglect magnetic interactions, as they are of a very 
much smaller order of magnitude than the electrostatic interactions. The 
Hamiltonian operator is then 




ZN 

= -I 


ZN 


i = i 2m 


v, 2 -I 


N 

z 


Ze 2 


i=i fl =i \R a -r t \ 


ZN ZN 

+iZI 

i±j 


N N 

+*II 


zV 


a*b \R a -R b \ 


( 6 . 1 ) 


where r t is the position vector of electron i, and R a is that of nucleus a. 
The first term represents the kinetic energy of the electrons; the second is 
the potential energy of the electrons in the field of the nuclei; the third 
is the potential energy due to the Coulomb interaction of the electrons; 
and the last term is the potential energy due to the Coulomb interaction 
of the nuclei. 

Although we shall be specifically concerned with metals, this basic 
Hamiltonian is, in fact, the same for all solids containing only one kind of 
atoms, and is only slightly more complicated if several kinds of atoms are 
present. No particular metallic properties have so far been required, although 
we assume that the crystal structure of the metal we are dealing with has 
already been incorporated in the positioning of the nuclei. The ideal procedure 
would be, of course, to assume random positions for the nuclei at first and 
then show that the minimum ground state energy is obtained when they are 
arranged on the correct crystal lattice. However, different lattice structures 
have very similar energies, so that, in view of the somewhat crude approxima¬ 
tions which have to be made, such a calculation is at present hardly feasible. 

We shall mainly be interested in the ground state energy, which is simply 
the lowest eigenvalue of the Schrodinger equation 

JT‘T = (6.2) 


The Hamiltonian depends upon the internuclear distance, and con¬ 
sequently so do E and W. However, if the internuclear distance is fixed, 
then W may in principle be determined as a function of the electronic 
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coordinates, both space and spin, and the corresponding energy found 
A different internuclear distance may now be chosen and the calculation 
repeated, and so on. In this way we can obtain the energy of the solid as a 
function of internuclear distance. The equilibrium internuclear distance 
will then clearly be that which gives the minimum ground state energy. 

Perhaps it should be mentioned that here the zero of energy is taken 
to be that of the nuclei and electrons when widely dispersed and at rest, 
so that their kinetic energy is zero and their interactions negligible. The 
zero of energy is arbitrary, however, and later we shall choose a more 
convenient one. 

We have remarked that a cubic centimetre of meial contains something 
like 10 23 nuclei and Z times this number of electrons, so it is quite clear 
that the Schrodinger equation cannot be solved directly, and, not only 
this, but even a simple approximate method like that of Hartree, which is 
so successful in the case of isolated atoms, cannot be applied directly. 
None the less, although drastic approximations must be made, certain 
simplifying features of the electronic wave functions in a metal, which are 
not present in isolated atoms and molecules, sometimes permit quite ac¬ 
curate numerical results to be obtained. 

We have been speaking so far of the total energy of the electrons and 
nuclei in the metal, but, in fact, we are usually not interested in this, but only 
in the amount by which it differs from the energy of a system of free atoms. 
This difference is called the cohesive energy of the solid, because it is a measure 
of the strength of the forces which bind the atoms together in the solid 
state a stable solid exists only because its energy is lower than that of a 
system of free atoms at the same temperature. We will define the cohesive 
energy formally as follows: 

the cohesive energy of a metal is the energy required to dissociate a given 
mass of the solid metal, at the absolute zero of temperature , into free atoms. 
In other words, it is the amount of heat which must be supplied to the metal, 
initially at 0°K, in order to vaporize it. If the effect of the surface of the 
metal is neglected, as it may be for most theoretical purposes, the energy 
is proportional to the number of atoms present. A convenient unit of mass 
to use is thus the gram-atomic weight or, what is the same thing for a metal, 
the gram-molecular weight, or mole, and cohesive energies are often quoted 
in kilo-calories per mole. It is sometimes more convenient to express the 
cohesive energy in electron-volts or rydbergs per atom, however. 

If we are only interested in the cohesive energy, it is possible to recast 
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the Hamiltonian (6.1) in a slightly simpler form, which takes into account 
the fact that some of the electrons have practically the same energy in the 
metal as they have in a free atom. 

6.1.1. REFORMULATION OF THE HAMILTONIAN 

We will now specifically consider a monovalent metal, although the 
general analysis is no more complicated for a metal of any valency. The 




(b) 


Fig. 6.1. (a) Radial charge densities of the K and L shells and of the 3s electron 
in a free atom of sodium, (b) Schematic representation of the charge clouds of 
the ion-cores (shaded circles) and of the valence electrons of two sodium atoms 
whose internuclear distance (3.67 A) is that of nearest neighbouring atoms in 

the solid metal. 

ideal metal from our point of view is sodium , for reasons which will become 
apparent — it is by no means an ideal metal from an experimental point 
of view. 
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The ground state configuration of a normal sodium atom is (1 s) 2 (2s) 2 (2p) 6 (3s) 
— that is, one 3s electron and closed K and L shells. The Hartree method 
has been applied to a sodium atom, and the radial charge densities in the 
various shells are found to be as shown in Figure 6.1 (a). The radial charge 
density of a given shell is defined as 4 nr 2 p(r), where p{r) is the total charge 
density of the electrons in the shell at a point distant r from the nucleus — 
the integral of the radial charge density with respect to r, from r = 0 to 
infinity, is thus the total charge in the shell. 

The K and L shells are confined almost entirely within a radius of 1 A 
from the nucleus, whereas the effective radius of the charge cloud of the 
3s electron is several angstroms — of course, in principle, all charge clouds 
extend to infinity. The chemical properties of sodium depend mainly upon 
the 3s electron, which is called the valence electron. In general, for any 
metal, the valence electrons may be thought of loosely as those whose 
charge clouds are most widespread in a free atom. They are also known 
as conduction electrons because they are largely responsible for the electrical 
conductivity of the solid metal. The nucleus and all the electrons except 
the valence electrons are together referred to as the ion-core of the atom. 

Figure 6.1(b) shows schematically the charge clouds of two sodium 
atoms whose internuclear distance (3.67 A) is equal to that of nearest neigh¬ 
bouring atoms in the solid metal. We see that the shaded circles, which 
represent the ion-cores, are well separated, whereas the large circles, which 
represent the valence electron distributions, overlap considerably. It is 
therefore reasonable to suppose that, when a cloud of free atoms condenses 
to form a solid, the K and L shells do not alter significantly, but that the 
wave functions of the valence electrons undergo a great change. In fact, 
the atomic wave functions of the 3s electrons on neighbouring atoms of 
the solid overlap so much that it is no longer appropriate to regard a 
valence electron as being associated with a particular atom, but its wave 
function must extend throughout the metal, and it must be associated 
equally with all the atoms. A metal may, in fact, be regarded as a lattice 
of ion-cores, together with a gas of valence electrons which may move 
more or less freely through the lattice. 

A useful quantity to bear in mind when considering the possible overlap 
of the ion-cores in the metal is the so-called atomic radius r s . This is 
defined by 



(6.3) 
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where v is the volume of the metal. In other words, it is the radius of a 
sphere whose volume is the mean volume per atom. If the effective radius 
of the ion-core is small compared with r s , we may assume that the ion- 
core of an atom is virtually unchanged in the solid state. This is more or 
less true of the alkali metals, the alkaline earth metals, and aluminium. 
It is not true, for example, of copper — in this metal the 3d sub-shells of 
neighbouring ion-cores overlap significantly, and this must be taken into 
account in cohesive energy calculations. The atomic radius of sodium is 
2.1 A, as shown in Figure 6.1(a) — it is twice as great as the effective 
radius of the ion-core. 

The cohesion of metallic sodium is thus due essentially to the redistribu¬ 
tion of the valence electrons in the solid state. If we imagine a system of 
free atoms (arranged on a widely spaced lattice, for simplicity), then, as the 
internuclear distance is reduced, or the lattice is contracted, the valence 
electron wave functions begin to overlap and, as shown in Fig. 6.2, the 
energy begins to fall. This is due mainly to a lowering of the potential 
energy owing to the spreading of the 
wave functions over several atoms. 

However, the kinetic energy begins 
to increase t, so that a minimum in 
the energy curve is eventually reached, 
and at smaller internuclear distances 
the energy increases rapidly. The 
minimum in the energy curve cor¬ 
responds to the equilibrium state of 
the solid, and the depth of the mi¬ 
nimum below the energy of the free 
atoms is the cohesive energy of the 
metal. 

Since the ion-cores are to be 
regarded as being the same in the metal as in a free atom, the problem 
of calculating the cohesive energy may be slightly simplified by treating the 
system as one of valence electrons moving in a fixed lattice of ion-cores. 
Thus, if V a (rt) is the potential energy of electron i in the field of ion-core a, 



Fig. 6.2. Energy of a metal as a function 
of internuclear distance. The zero of 
energy here is the energy of a system of 
free atoms. S is the cohesive energy of 
the solid. 


t We have seen in § 2.3, equation (2.39), that for a particle in a box the kinetic energy 
increases as the size of the box decreases, and it is generally true that a particle has a 
larger Kinetic energy the smaller the region in which it is confined. 
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the Hamiltonian operator becomes 



We have used the more compact notation r ;; instead of l^ — Vj\ and R ab 
instead of \R a —R b \. 

The last term in (6.4) represents the energy of interaction of the ion- 
cores: its simple form rests on the assumption that the ion-cores are far 
enough apart for their interaction to be considered as that of spheres 
each with a single positive charge e, the remaining charge of the nucleus 
being screened by the core electrons. This, of course, only applies to mono¬ 
valent metals — for divalent metals the ion-cores would have charge 2e, 
so that the last term in (6.4) would have to be multiplied by four, and 
so on. 

It should be noted that the zero of energy is now that of a system of 
widely separated ion-cores and valence electrons at rest — this differs from 
the energy of a system of widely separated neutral atoms, the difference 
per atom being just I, the first ionization energy of a free atom, that is, 
the energy required to remove one electron from the neutral atom in its 
ground state. 

Clearly, the Hamiltonian (6.4) is not simplified to such an extent that 
we can dispense with the one-electron approximation, and indeed we must 
still be content with some relatively crude form of this. For the time being, 
however, we will not go into details of how to obtain the one-electron 
functions, but will assume that the N functions which give the lowest energy 
consistent with the Pauli principle have been obtained, and will denote 
them by 


• • • ^nC^JvX 


including spin. We will also assume, for simplicity, that these functions 
are orthogonal and normalized, so that 



(6.5) 


The effective extent of the wave functions beyond the surface of the metal 
is negligible, and so we will take all integrals throughout the volume of the 
metal, rather than throughout all space — we will later adopt boundary 
conditions consistent with this procedure. 
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6.1.2. SINGLE PRODUCT WAVE FUNCTION 

First of all, and solely for purposes of comparison with our later results, 
we will calculate the total energy using a wave function consisting of a 
single product of one-electron functions, that is, 

= 0iC*i) 02 (^ 2 ) • • • 0jvC*jv)- (6.6) 

Owing to the normalization of the <f> lt T 0 is also normalized, and the 
energy is given by 

E 0 = j'F*H'P 0 dt', 


using the notation introduced in equation (5.27). Thus, with the Hamil¬ 
tonian (6.4), we have 


E 0 = i n 


f, 2 N 

n *-72 


— Vf+I VXr,) 

2 111 a 




N N r e 2 N N 2 

+ill (6.8) 

i*j J r u a*b R ab 


the last term being unchanged as it does not depend upon the electronic 
coordinates. Again, owing to the normalization of the 0 f , this becomes 


N c r h 2 N 1 

£.=zUw -f-V?+LF„(r,) 

1 J L 2 m a 


0i(*;)d*; 


+ i ZZ jj— \4>i(Xi)\ 2 \<t>j(Xj)\ 2 dXidXj 


N N 2 

+iZZf 

a * b E ab 


(6.9) 


According to the notation of § 5.3, we write 

4>i(x) = 


( 6 . 10 ) 


where the spin function Xi may be either a or /?. We may then sum over 
the spin variables immediately, obtaining 
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( 6 . 11 ) 


Finally, since the value of a definite integral is independent of the symbols 
used for the variables of integration, we may use, for example, r instead 
of r { in the first term, and r x and r 2 instead of r t and fj in the second; thus, 


E JVfM [- ~ + Z v X r ) 'H r ) Ar 




( 6 . 12 ) 


We do not expect the energy given by this expression to be very accurate, 
because the wave function (6.6) is not properly antisymmetric. We shall 
later calculate the error involved for the idealized case of a system of free 
electrons. Meanwhile, let us determine the equivalent expression using a 
determinantal wave function. 

6.1.3. DETERMINANTAL WAVE FUNCTION 

Using the same one-electron functions as in the previous section, we now 
construct the normalized, antisymmetric total wave function 



which may be written in the determinantal form (5.24). 

The Hamiltonian is a symmetric operator, for it is unchanged under 
any permutation of the electronic coordinates, so that in evaluating the 
total energy E we may make use of the theorem expressed in equation 
(5.28). Thus, 
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Because of the orthogonality and normalization of the 4> h this ex 
pression becomes 




(6.15) 


In obtaining the first term, only the identity operator in the expression 
for that which leaves the electronic coordinates unchanged — gives 

a non-zero result. All the other permutations give rise to factors like 



which are zero by (6.5). In obtaining the second term, however, both the 
identity operator and the operator which interchanges x t and x Jt leaving 
the other coordinates unchanged, give non-zero results — this is due to 
the factor e 2 /r u in the integrand. For the latter operator, p = 1, which 
accounts for the minus sign before the second term in the square brackets 
of the double integral. 

Expanding the second term in (6.15), we obtain 





a * b R-ab (6.16) 
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Notice that this differs from expression (6.9) for E 0 only in the presence 
of the third term, which is called the exchange energy, because it is due to 
the exchange of electronic coordinates in the various terms of the expanded 
determinantal wave function [see equation (5.40) et seq. ]. The second term, 
which represents the interaction of the charge clouds of the electrons, may 
be called the Coulomb energy. 

As in the case of equation (6.9), the summation over the spin variables 
can be performed immediately for the first two terms in (6.16), the result 
being merely to replace <j> t by i/q and x t - by r t . The exchange energy must 
be considered in more detail. 

First, if 4>i and (pj have the same spin, their spin functions both being a, 
say, the integral in the exchange term becomes 


X (T— <A*(»y)<A*(X)’/'-•O'-) 'i'j(r J )(ir i drj\(x (( ; )\ 2 |oc({ y )| : 

a,tj=±iJJ r u 



<A. ? ( r j) tj (**.) M r i) tf'X'V) <Mr y . 


The same result follows if both spin functions are /?. 

Second, if < p f and <pj have opposite spins, the integral becomes 


X f f — 'P?( r j)'l'*( r i)'l / i( r ;)'l / j( r j) dr i dr j <x(tj)P(CMOP(tj) = 0- 

Cii Cj — i 1 ^ Fi; 


The exchange energy is, therefore, 


jv N r r 

-*^/J 

(II spins) 


<A* ( r j) ( r i) l l / j( r j) dr i d rj 

= -2 XX ff— 'l't( r i)'l'j( r 2)'l'i(r 2 )ipj(r 1 )dr l dr 2 , 

iitj jj r l2 


spins) 


the double sum being over those functions t/q and i p ; associated with the 
same spin functions, that is, over electrons with parallel spins only. 

The expression for the total energy in terms of orbital functions only is, 
thus, 
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i*j JJ r 12 




(6.17) 


using the same notation as in (6.12). It should be noticed that, in deriving 
this expression for the energy, no assumption has been made about the 
form of the functions ii/q — the expression is the same whether these are of 
localized (Heitler-London) type or of extended (Bloch) type, as discussed 
in § 5.8. We shall, in fact, use functions of the latter type, but we shall leave 
their description and the method of calculating them until later chapters 
— it will suffice for our present purposes to say that we assume there is 
the same chance of finding a given electron in the neighbourhood of any 
ion in the metal. 

If i is put equal to j, the integrands in the second and third terms of 

(6.17) become the same. Since these terms have opposite signs, therefore, 
we can omit the condition i ^ j from each of them. Changing the variable 
of integration from r to in the first term which will prove convenient in 
the following work, we have, finally, 





(6.18) 


This expression has been obtained, of course, for a monovalent metal. 
If the valency were q, say, so that the total number of valence electrons 
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were Nq, the sums over i and j would be from one to Nq, and the last 
term would be multiplied by q 2 . 

6.2. The Hartree-Fock Method 

We have seen in Chapter 4 that, according to the variation principle, 
Hartree’s equations give the best one-electron functions to be used in a 
single-product type of wave function for the ground state. This is not so 
when a determinantal type of wave function is used. Application of the 
variation principle using a determinantal wave function results in equations 
for the one-electron functions i (/ t which are similar to those of Hartree but 
which contain an additional term — the ‘exchange’ term. These equations 
are called the Fock, or Hartree-Fock, equations. 

We will not apply the variational method directly, as this is a somewhat 
complicated process, but will attempt to give a plausible derivation of 
Fock’s equations by a method similar to that used in deriving Hartree’s 
equations in § 4.6. The only terms in the expression (6.18) which depend 
upon the particular function i/q are the following 




(spin j 
= spin i) 


The factors \ have disappeared because the number i occurs in both sums 
of the double summations. 

The expression (6.19) may be written in the form 



( 6 . 20 ) 


where the operator H x is given by 



( 6 . 21 ) 


= sp i n i) 


as may immediately be verified by substitution. 
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For the ground state we wish to choose those one-electron functions 
which make the energy E an absolute minimum. Since the only terms of E 
which contain i/q are represented by (6.20), we must choose the function 
1 \>i which minimizes this expression. The variation principle tells us that 
the required i/q is the eigenfunction of the equation 


H i^i = 

with its lowest eigenvalue e t . This is the Fock equation for the function 
t/q. Written in full the equation is 



a 






dr. 


12 


— e 


£ f i A*( > '2)<A,<>'2)<AX>'i) 

J . LJ f*12«Ai(»'l) 


(spin j 
= spin 0 


1) 

2 'I'iir i) = £,«/' < (»‘i) • 


( 6 . 22 ) 


The operator in the first square brackets is just the Hartree operator, 
and may be compared with the operator in equation (4.29) for a single 
atom — the fact that j = i is not excluded from the third term is of no 
importance in a metallic system, since the omission of one electron in 10 23 
can hardly have an appreciable effect upon the potential. This is, of course, 
only provided that the one-electron wave functions are extended ones, 
so that the removal of a single electron causes only a negligible change 
in the average charge density at any point of the metal. The remaining 
term on the left-hand side of the equation, the exchange term, may equally 
well be written 



(spin j 
= spin i) 


r l2 



<h(r 0 - 


(6.23) 


There is an equation of the same form for each of the different one-electron 
functions, and these equations must be solved simultaneously. For a single 
atom this can be done by a method of successive approximations, until 
self-consistency of the required degree of accuracy is reached, as was 
described for the Hartree equations. In metals the problem is too complicated 
and a cruder approximate form of the equations must be used. In the 
idealized case of a free-electron gas, however, which is of some interest in 
metal theory, the equations can be solved analytically, and we shall do this 
in the following chapter. 
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Of course, although we have said that \ i/q is the eigenfunction of equation 
(6.22) corresponding to its lowest eigenvalue , in practice this must be modified 
to its lowest eigenvalue permitted by the Pauli principle. We know that, 
according to the Pauli principle, not more than two electrons may be in 
a given orbital state, and these must have opposite spins; a fact which 
must be borne in mind when solving the equations. In the ground state 
of a non-ferromagnetic metal at 0°K, which is the only case we shall consider 
in detail, all the orbital states of lowest energy are occupied by two electrons. 
In a ferromagnetic metal, on the other hand, the state of lowest energy is 
one in which there are more electrons with one kind of spin than there are 
with the other, so that some orbital states are occupied by only one electron. 

Although equation (6.22) is correct, it must not be pretended that our 
derivation of it has been rigorous. The principal objection to it is that the 
operator itself depends upon i j/ l} so that it is not obvious that the variation 
principle can be applied directly to (6.20) as we have done. None the less, 
this derivation is sufficient for our purposes, being more informative than 
a mere statement of the equation, yet avoiding complicated calculations 
which lie outside the scheme of this book. 

It is easily seen that the total energy of the electrons is not simply the sum 
of the Fock eigenvalues £ ; . If we use equation (6.22) in conjunction with 
(6.18), we find immediately 



(II spins) 



(6.24) 


None the less, the eigenvalue £ ; has a simple interpretation for a metallic 
system when the one-electron functions are of the extended type: it is the 
negative of the energy required to remove from the solid an electron in the 
state i (/{. As we have already mentioned, if the average charge distribution 
of every electron is spread throughout the metal, and there are something 
like 10 ' electrons, then the removal of a single one of them will hardly 
affect the Fock Hamiltonian i/ ( .. We may thus assume that the functions 
*Pt ^e virtually unchanged after the removal of a single electron. 
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Now we have seen that a given \[/- t only appears in the expression for E 
through the terms shown in (6.19). If an electron in the state i j/ t is removed 
from the metal, then, the total energy must be reduced by the value of 
the expression (6.19), namely, e t . The energy which must be supplied in 
order to remove the electron is therefore — e i} as we wished to prove. This 
result is generally known as Koopmans’s theorem. 


6.3. Coulomb Correlations and the Fermi Hole 


Electrons repel one another, so that they do not move independently 
but in such a way as to avoid each other as far as possible. In other words, 
the effect of the Coulomb force is to couple, or correlate, the individual 
electronic motions so as to reduce the probability of two electrons closely 
approaching each other. Such correlations among the electronic motions, 
or positions, may be called Coulomb correlations. 

In the Hartree method Coulomb correlations are completely ignored, 
each electron being supposed to move in the average charge distribution of 
the other electrons. The total wave function is a single product of one- 
electron functions, so that the probability of a given configuration depends 
only upon the one-electron functions and not directly upon the distances 
between pairs of electrons. 

The Hartree-Fock method again neglects proper Coulomb correlations, 
as defined above, but includes correlations of another sort. These are 
correlations among the positions of electrons with parallel spins only, 
and are due, not to the Coulomb force, but to the Pauli principle, as 
embodied in the use of a determinantal wave function. The latter is of 
the form 


V = 


4> i(*i) <f>i(x 2 ) ■ • • 

<f>zM </> 2 (*2) • • • <t> l(%) 


0/v( X 2) • • • $w( x jv) 


where it must be remembered that 


<f>(x) = iKO x(0, 

the spin function x being either a or p. It can easily be seen that with such 
a total wave function there is zero probability of two electrons with parallel 
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spins + being in the same place. For example, suppose that electrons 1 and 
2 have parallel spins, so that £i = ( 2 , and are in the same place, so that 
r i = r 2 - This implies that = x 2 , and the first two columns of the deter¬ 
minant are the same, so that the value of the determinant, and consequently 
the probability density |!F| 2 , is zero. If the electrons have anti-parallel spins, 
that is, if Ci # ( 2 , this does not apply, for now x t # x 2 even when r 2 = r 2 . 

The use of a determinantal wave function not only implies zero probability 
of finding two electrons with parallel spins in the same place, but also that 
there is only a small probability of finding two such electrons within a short 
distance of each other. It is as if, during its motion, each electron were 
surrounded by a ‘hole’ in the distribution of electrons with parallel spins. 
This is called the exchange hole or Fermi hole. We shall see in the following 
chapter, after discussing the free-electron approximation, that the Fermi 
hole may be regarded roughly as a sphere of radius r s , but meanwhile 
the idea can be made plausible by means of the Fock equation (6.22). It 
must be remarked in advance that, in spite of the footnote on this page, 
we will at first assume that the electron at position r x is in the particular 
one-electron state 0* — this is because we shall be arguing by analogy 
with the Hartree method, for which this is a permissible statement. 

The exchange term in equation (6.22) may be written 




where 


Mri) = 


* r 

? j 

(spin j 
= spini) 




12 


l A;( r i) 


dr, 


Just as the Hartree potential energy of the electron at r 1} namely, 



is that due to a charge distribution whose density at r 2 is 

Wofo)! 2 . 

j 


t The meaning of this is that the two electrons have the same spin directions, or coor¬ 
dinates, not that they are in specific one-electron states having the same spin functions. 
When a determinantal wave function is used it is not strictly correct to think of any 
given electron as being in a particular one-electron state, but each electron must be 
regarded as partially in all of the states. 
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so the function A i (r 1 ) may be interpreted as the potential energy of the 
electron at I*! due to a fictitious exchange charge distribution, whose density 
at 1*2 is 

e y 'Pj( r 2)'l'i(r 2 )'l'j(ri) 

, J. U'i) 

(spin ; 

= spin i) 

The total exchange charge is 

* £ f ^ = e 

, J . J >/',•(»' i ) 

(spin; 

= spin i) 

since 

J 'l'*( r 2)'l'i(r 2 )dr2 = 6 U , 

owing to the orthogonahty and normalization of the one-electron functions 
— see equation (6.5). This is a positive charge equal to that of a proton. 

Furthermore, the exchange charge density at r 1} obtained by putting 
r 2 = r l in (6.25), is 

* I \Uri)\ 2 > (6-26) 

J 

(spin; 

= spin i) 

which is equal in magnitude, but opposite in sign, to the average charge 
density of electrons with spins parallel to that of the electron at — 
this is supposed to be in the orbital state i J/ t and some spin state x> and 
the sum is over all the orbitals associated with the same function x■ Since 
the total exchange charge is only e, the exchange charge density must 
decrease rapidly from the value (6.26) as the distance r 12 increases. 

Apart from the exchange term, the potential energy appearing in the Fock 
equation (6.22) is that of an electron in the field of the ion-cores and the 
charge cloud of all the valence electrons. The foregoing discussion suggests 
that, in addition, and due to the exchange term, the electron may be regarded 
as surrounded by a positive charge cloud of total charge e and of limited 
extent. This effectively cancels the average charge distribution of electrons 
with parallel spins in the neighbourhood of the given electron and gives 
rise to the Fermi hole described above. 

It should be pointed out that, in general, the exchange charge density 
as defined by (6.25) is not spherically symmetrical about r t ; it is not the 
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same for different i/^; and it need not be a real function — as we shall 
see in the case of free electrons. It would, in fact, be more realistic, particular¬ 
ly in view of the footnote on p. 150, to average the exchange charge density 
over all the t/^-. When we do this for free electrons, in § 7.6, we shall obtain 
an average Fermi hole which is spherically symmetrical and which otherwise 
accords with the description given. 

Although these parallel spin correlations which appear in the Hartree- 
Fock theory are not due to the Coulomb interaction, they affect the energy 
in the same way as Coulomb correlations. The total energy given by the 
Hartree-Fock method is thus an improvement on that of the Hartree 
method, which neglects all correlations — the principal difference is due 
to the appearance of the exchange energy, which we shall find to be large 
and negative, in the Hartree-Fock expression. Antiparallel spin correlations 
are still completely neglected, of course, and we shall see that this has some 
unfortunate consequences. It might be expected that, in a more accurate 
theory, each electron would be surrounded by a hole, similar to the Fermi 
hole, in the distribution of electrons with anti parallel spins also. We shall, 
in fact, make use of this concept of a correlation hole in the distribution of 
all electrons in Chapter 9, leaving its justification until Chapter 10. 


CHAPTER 7 


THE FREE-ELECTRON APPROXIMATION 



7.1. The Periodic Field in a Metal 

The total charge distribution of a closed shell of electrons in a free atom 
is spherically symmetrical about the nucleus, and it follows that the potential 
energy of a valence electron in the field of the ion-core is also a spherically 
symmetrical function. This function V(r) for a sodium atom is compared 
with the corresponding function — e 2 /r for a hydrogen atom in Figure 7.1 
— r being the distance from the nucleus. The /-(A) 

nuclear charge of sodium is lie, so that at points 
very close to the nucleus the function V(r) is 
practically -11 e 2 /r, the electrons there having 
negligible effect. At points lying outside the 
effective radius of the ion-core, however, V(r) 
is very nearly — e 2 /r, because the field of the 
nucleus is screened by the ten core electrons so 
that it becomes virtually the field of a single 
proton. At intermediate points the function 
cannot be estimated in a simple manner. 

The potential energy of an electron in a solid 
metal, of course, is not obtained simply by 
summing such potential energy functions, one for each ion in the metal, 
even when we assume that the structure of the core remains unchanged 
in the solid state. The interactions of the valence electrons among them¬ 
selves must be taken into account. The potential energy of any electron 
due to the average charge distribution of the valence electron gas may 
be found by the Hartree method. However, if more accurate account is 
to be taken of electronic correlations, as discussed in § 6.3, then the Hartree- 
Fock method or modifications of it must be used. We shall consider this 
problem in more detail later. Whatever method is used, the effect of the 
other electrons is such as to screen a given electron from the field of all 


Fig. 7.1. Potential energy of 
the valence electron in a free 
atom of sodium, compared 
with that in a hydrogen 
atom (broken curve). 
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the ion-cores except those in its immediate vicinity, so that the potential 
energy is numerically much less than it would be if only the ion-cores 
were present. 

The potential energy of a valence electron as a function of position 
along a line of nuclei in the metal is 
shown schematically in Figure 7.2 (a). 

Along a parallel line, which does not 
pass through any nuclei, the function 
would be as shown in Figure 7.2 (b). 

There is a potential barrier at the 
surface, of course, since the electrons 
cannot pass freely out of the metal. 

However, except in a few atomic 
layers at the surface, the potential energy 
is an almost exactly periodic function 
with the period of the lattice. Since, in a 
cubic centimetre of metal, only about one atom in a million is at the surface, 
the effect of the surface is unimportant when calculating bulk properties 
of the metal, such as the cohesive energy. We therefore assume that the 
electrostatic field in which a valence electron moves in a metal is strictly 
periodic. 

7.2. The Sommerfeld Model 

The earliest application of quantum mechanics to the electrons in a metal 
was made in 1928 by Sommerfeld, who simplified the problem by taking 
the potential energy of each electron to be constant within the metal. That 
is to say, he assumed that the total field of the electrons and ion-cores 
at any point in the metal could be taken to be zero as a good approximation. 
Although this model seems to be a highly idealized one, we shall see that 
it is not unreasonable for the calculation of at least some of the physical 
properties of a number of metals. 

The value assumed for the constant potential energy will clearly be of 
great importance in calculating such quantities as cohesive energy and work 
function (the energy required to remove an electron from the metal). In 
many calculations, however, only the distribution of energy levels relative 
to the lowest level is important and not their positions relative to some 
external zero. In such cases we may arbitrarily take the potential inside 
the metal to be zero, and as a slight further approximation the potential 



(b) 

Fig. 7.2. Potential energy of an elec¬ 
tron in a metal: (a) along a line of 
nuclei, (b) along a parallel line. 
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outside the metal may be taken to be infinite. We shall adopt a more sophis¬ 
ticated viewpoint in § 7.4, but meanwhile the problem is simply that of 
virtually non-interacting particles in a potential box of the type we have 
considered in § 2.3. Assuming that the metal is in the form of a cube of 
side L, we have immediately that the normalized wave functions of the 
stationary states (neglecting spin) are 



(7.1) 


within the metal, and zero outside, and the energy levels are 



(7.2) 


where n x , n y , and n 2 are positive integers. 

According to the Pauli principle not more than two electrons may occupy 
any orbital state, so that, at the absolute zero of temperature, two electrons 
will go into the ground state, two into each state of next higher energy, 
and so on, until all the electrons are allocated to states of lowest possible 
energy. Since the number of electrons is very large, it is thus understandable 
that, even at the absolute zero of temperature, some electrons have kinetic 
energies of several electron volts. For a piece of metal of macroscopic 
dimensions, say a centimetre cube, the energy of the ground state 
(n x = n y = n z = 1) is of the order of 10“ 15 eV and hence may be taken 
to be zero for all practical purposes. Also the maximum spacing between 
consecutive energy levels is less than 10“ 6 eV, so that the distribution of 
energy may be regarded as almost continuous — it is sometimes described 
as ‘quasi-continuous’. In particular, it is possible, and useful, to define a 
density of states in energy ^V(s) in the following way: 

the number of orbital states with energies lying between e and e + de is 
A''(e)de. 

Here it is assumed that d s is very small but still large enough for the interval 
to contain very many states — it is easy to see that this is possible, for the 
above considerations show that, if ds is only 10 -3 eV, the interval will 
contain at least 1000 energy levels. It follows that the number of orbital 
states, v(s) say, with energies less than e is 
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so that 


A'(e) = dv/de. 


We shall use this in calculating jV(z). 

Let us consider a system of rectangular Cartesian coordinates whose 
axes represent n x , n y , n z . We note, first of 
all, that each orbital state is represented by 
a triad of positive integral values of n XJL n y , 
n z , and hence by a point in the positive octant 
of this ‘n-space’. Also, if we divide the octant 
up into unit cubic cells, every point represent¬ 
ing a state lies at a corner of a cell, so that 
unit volum e cont ains just one representative 
point. Now, the equation 


n, + n v + n. — 


2 mL 
h 2 n 2 



s, 


(7-3) 


Fig. 7.3. The positive octant of 
n-space. The dots represent 
states. 


from (7.2), is that of a sphere in n-space, 
the right hand side being the square of the 

radius. Thus, the number v(s) of states with energies less than e is just the 
number of representative points lying within the positive octant of this 
sphere. Since there is one representative point per unit volume, therefore, 
this number is equal to the volume of the octant, that is 


(7.4) 


where v = L 3 , the volume of the metal. Differentiation with respect to s 
gives the density of states 


1 4n j 

( 2mL2 e) 

i* = JLj 

f2m\ 

8 ~3 ‘ 

\h 2 n 2 J 

' 6n 2 ' 

in 2 / 


^ ■ i ( 


?)'•* 


(7.5) 


The graph of A ( s ) is the parabola shown in Figure 7.4. It should be noted 
that here A (e) is the density of orbital states for a metal of volume v — 
the density of states is sometimes defined with respect to states including 
spin, and for unit volume of metal, which would have the effect of replacing 
v by 2 in (7.5). 

The above treatment ignores the fact that the surface of the sphere cuts 
through some of the unit cubic cells, but this is reasonable, since, except 
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in the immediate neighbourhood of the origin, v(e) is very large. The 
approximation assumes, in fact, that v(e) is a continuous function of e — 
this is not strictly true, of course, for we can never have a non-integral 
number of states, but it is equivalent to the assumption we made in defining 
^( £ ). 

At the absolute zero of temperature, as mentioned before, all the states 
with energies up to some maximum, which we will denote by ( 0 , will 
be occupied by two electrons with opposite spins. If N is the total number 
of electrons, we must have 

N = 2J o l V W d £ = £ (7.6) 

This gives the highest occupied energy level as 



which has the value 3.2 eV for metallic sodium. 

We have assumed that the potential energy is zero, so that all the energy 



Fig. 7.4. Density of states in energy for a free-electron 
system. The shaded region represents the distribution of 
occupied states at the absolute zero of temperature: its 
area is $N. The broken curve shows the ‘tailing off’ 
of the top of the occupied region at a temperature of 
about 1000°K, assuming the electron density to be 
roughly that in metallic sodium. 


is kinetic, and Co is the maximum kinetic energy at the absolute zero of 
temperature. The total kinetic energy of the electrons is given by 

2 J> (£)6ds - £? - iNU - 
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The average kinetic energy per electron, called the Fermi energy t, which 
we will denote by E F , is, therefore, 


E f — 


Ko- 


(7.8) 


For a monovalent metal the atomic radius r s , as defined in § 6.1.1, is 
given by 



(7.9) 


For polyvalent metals the number of electrons is not equal to the number 
of atoms, so that for N, in the above equation, we must substitute N/Z, 
where Z is the valency. In order to generalize the results it would be con¬ 
venient to introduce another radius, r e say, which is the radius of a sphere 
whose volume is equal to the average volume per electron, so that for 
all metals we should have 


471 

3 


v 14ti , 

— =- r,. 

N Z 3 


(7.10) 


This gives r e = Z~*r s in general, and r e = r s for a monovalent metal. 
In order to conform to the more general usage, however, we shall, unless 
otherwise stated, restrict ourselves to monovalent metals and use r $ as 
given in equation (7.9). None the less, it should be understood that all 
the results can be generalized to polyvalent metals merely by substituting 
r e for r s in the energy formulae. Of course, we have said nothing yet about 
the applicability of the approximations we are using to metals of different 
valencies, and we shall see, in fact, that the methods of the present chapter 
are more suited to monovalent metals. 

Using equations (7.7), (7.8), and (7.9), we find 



'97l\* 1 



(7.11) 


1 No universal notation yet exists in the theory of metals. One-electron energies are 
often denoted by E, rather than e, and the maximum kinetic energy of an electron, which 
we have denoted by f 0 , is sometimes denoted by E F or £ 0 , and it is sometimes also referred 
to as the Fermi energy. In the following work, except in one or two instances, we shall 
use the letter e to denote an energy associated with a particular one-electron state, and 
the letter E to denote an average energy per electron of a system — appropriate sub¬ 
scripts will be used to denote the particular energy terms we are considering. 
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and 


3ft 2 /9 tt\M 
10m \ 4 / r 2 


(7.12) 


For all metals r s is of the order of an angstrom unit, from which it follows 
that £ 0 an< 3 F f have values of several electron volts. 


7.2.1. FERMI-DIRAC STATISTICS 

The foregoing work, in which the temperature has been assumed to be 
absolute zero, is the simplest application to metals of the so-called Fermi- 
Dirac statistics. At higher temperatures the distribution of occupied states 
does not end abruptly at £ 0 , as does the shaded region in Figure 7.4, but 
tails off smoothly, as shown by the broken curve in the same figure. At 
absolute temperature T the number of electrons with energies lying between 
£ and e + d£ is found to be 


2^(£)/(£)d£, 

where f(s), known as the Fermi-Dirac distribution function +, is given by 


/( £ ) = 


1 

e (‘-?>/#r +1 • 


(7.13) 


Here k is Boltzmann’s constant and £ is a function of T which is determined 
by the equation 


2 f -A r (e)f(e)de = N, 
J 0 


(7.14) 


expressing the fact that the total number of electrons is N. 

We will now show that £(0), the value of £ at 0°K, is just what we have 
denoted by £ 0 . It is easy to see that, with £(0) positive, 

lim Q( e ~0/fi T = 0, if e < £(0), 

T-0 


and 


e (£ 0/A T -> oo as T -* 0, if e > £(0). 


* See, for example, M. Born, Atomic Physics, 6th ed. (Blackie, 1957), Chapter VIII, 
for a simple derivation. 
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At the absolute zero of temperature, therefore, the function/(e) becomes 



e < C(0), 
e > C(0), 


(7.15) 


so that equation (7.14) reduces to 

rao) 

2 JT{£) de = N. 

J o 

Comparing this with equation (7.6), we find that £(0) = £ 0 , as required, 
and the density of occupied orbital states with /(e) given by (7.15), 

is precisely what we found previously. At higher temperatures £ is not 
equal to £ 0 but, certainly at temperatures up to the melting points of most 
metals, it is very nearly so. 

The general features of the function /(e) for any temperature T are easily 
obtained from equation (7.13). The value of/(£) is always 0.5. For values 
of e much smaller than £, /(e) is nearly unity, and for values of e much 
greater than £, /(e) is very nearly zero — indeed, at normal temperatures, 
/(e) differs appreciably from the function (7.15) only in a small region 
about £. In order to estimate the width of this region, let us evaluate /(e) 
when e = £ — 2 kT and when e = £ + 2 kT. 

M-2AT) = —'— = 0.88, 
e 2 + l 


M + 2kT) = —= 0.12. 
e +1 


Thus, for practical purposes, we may assume that /(e) decreases from 
unity to zero in a region of width about 4 kT about £. Now, at room tem¬ 
perature, 300°K, kT is equal to an energy of 0.026 eV, whereas we have 
seen that £ 0 , and hence £, has a value of several electron volts. It follows 
that, at normal temperatures, 

&T<£. 

This means that the fall in value of f(e) near the energy £, or £ 0 approximately, 
is so rapid that the distribution of electrons in energy is very little dif¬ 
ferent from that at 0°K. The function f{e) at 0°K and at 1000°K is shown 
in Figure 7.5, the value of £ 0 having been chosen for illustration to be 
3.2 eV, as for sodium. 
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In the remainder of the book we shall make 
Dirac distribution function. Apart from a 
simple treatment of the electronic specific 
heat, in which we shall use the general 
results obtained above, we shall assume 
either that the temperature is absolute zero 
or that, at any temperature we may be 
considering, the distribution function does 
not differ appreciably from that at absolute 


no explicit use of the Fermi- 


m 

1.0 

0.5 

0 


_L 

2 


£(eV) 





_L 

4 


zero. 


7.3. Periodic Boundary Conditions and 
Running Waves 


Fig. 7.5. The Fermi-Dirac distri¬ 
bution function f(e). The full 
curve is at 0°K and the broken 
curve at about 1000°K, as¬ 
suming the electron density to 
be that in metallic sodium. 


(i) One-dimensional metal. For simplicity, 

let us first consider a one-dimensional metal, which may be visualized as 
a thin wire of length L. We take the Y-axis along the wire, with the origin 
at one end, and neglect all motion except that in the x-direction. If we 
apply the Sommerfeld theory to this case, that is, take the potential energy 
to be zero within the metal and infinite outside, we simply have the problem 
of a particle in a one-dimensional box, discussed in § 1.5. The energy 
levels are 


= 


h 2 n 2 
2 mil 


n = 1, 2, 3,. .., 


(7.16) 


and the normalized wave functions are 


<i>. = 



(7.17) 


We restrict the values of n to positive integers, because changing the sign 
of n does not give a different wave function — all wave functions are ar¬ 
bitrary to the extent of a multiplying factor of modulus unity. 

The wave functions ip n represent standing waves — they describe the 
states of motion of an electron which is bouncing to and fro between the 
walls of the box. This can readily be understood from the fact that the 
average value of the momentum is zero for all ip n . According to 

§ 2.9, we have 
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<P*> = 



i dx 


dx 


2wrfc f L . mix mix 

—-— sin-cos —- 

ili Jo L L 


This does not mean that the electron is stationary, but that a measurement 
of its momentum at any instant is equally likely to result in a positive or a 
negative value. 

Such standing waves, although undoubtedly correct for the system we 
have described, are of no help in problems of conductivity, for which we 
require running waves, or functions which describe the motion of an electron 
in a definite direction. A function of this type is 

$ k = e ik *. (7.18) 


We have seen in § 2.8 that this function represents a free particle travelling 
in the x-direction with constant momentum hk. It also satisfies the correct 
Schrodinger equation inside the metal, namely, 


dV 

dx 2 


2m . . 

+ -y e>A = °, 
h 2 


provided that 


h 2 k 2 

e = -. 

2m 


However, it is clear that \J/ k does not satisfy the correct boundary conditions, 
for it does not vanish at x = 0 and at x = L. None the less, in the one¬ 
dimensional case, it is very easy to devise a physical situation for which 
the appropriate boundary conditions will permit t j/ k as a solution. In order 
that a continuous electric current might flow, we must have a closed circuit. 
Let us, therefore, simply bend our wire round until the two ends are touch¬ 
ing, but still measure x along the wire. The Schrodinger equation inside 
the metal is then unchanged, but the infinite potential walls have been 
eliminated, and with them the necessity for the vanishing of the wave function 
at x = 0 and at x = L. The requirement that a wave function be smooth 
and continuous now imposes the boundary conditions 

HO) = HL), f(0) = «A'(L). (7.19) 

Since the point (x+L) is now the same as the point x, it follows, in fact, 
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that i p must be a periodic function of x with period L, that is, 

if/(x+L) = iftx), (7.20) 

for all x — this equation includes both of the conditions (7.19). These new 
boundary conditions, either (7.19) or (7.20), are called periodic boundary 
conditions. 

Applying these conditions to the function i{/ k , we find 

1 = e' kL , 

so that, 

k = -, n = 0, +1, +2,. ... 

L 


Notice that we now permit both positive and negative values of n, because 
e' kx and e~' kx represent different (but degenerate) states — they describe 
electrons travelling in opposite directions. The energy is 


£ k = 


tse 

2m 


2 mil 


(2 »f. 


(7.21) 


Comparing (7.21) with (7.16) we see that the energy levels e k include 
only one half of the energy levels e n — they also include zero, but this is 
of no significance. None the less, the density of states is exactly the same 
in the two cases, because to each level e k , except zero, correspond two 
orbital states, with positive and negative values of n respectively. We have 
seen, in our work on the Sommerfeld model, that for a metallic system 
the number of electrons is so large, and the spacing between levels is so 
small, that the only quantity of real importance is the density of states, 
and the individual energy levels only matter in so far as they contribute 
to this. We may therefore conclude that the use of periodic boundary con¬ 
ditions and running waves leads, as far as the bulk properties of metals are 
concerned, to the same results as are obtained with standing waves — 
running waves are more useful, however, in that they are applicable to 
conduction problems, and they are mathematically more convenient, so 
that we shall use them exclusively from now on. 

(ii) Three-dimensional metal. In applying the Sommerfeld theory to a cube 
of metal of side L, we assumed that the wave functions vanished on the 
surface of the cube, which led to the standing wave type of functions (7.1). 
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As explained above, it is generally more convenient, and in conduction 
problems necessary, to use functions representing running waves, which 
in the present case are of the type 

il/ k (r) = e ik ' r = e ! < k ** +k * + * 3 ‘>, (7.22) 

describing the motion of a particle with constant momentum hk. These 
functions satisfy the Schrodinger equation in the metal but not the original 
boundary condition — instead, they are required to be triply periodic in 
x, y, and z, with period L, that is, 

il/ k (x + L, y, z) = iA fc (x, y + L,z) — t j/ k (x, y, z + L) = ij/ k (x, y, z). (7.23) 

Equivalently, we may require that i j/ k and its gradient have the same values 
at corresponding points on opposite faces of the cube. We shall show in 
the following section, if it is not obvious from the work which has gone 
before, that these periodic boundary conditions lead to exactly the same 
density of states as we found previously — equation (7.5). 

Notice that, in the present case, there is no question of justifying these 
conditions physically in the same way as for a one-dimensional metal — 
it is impossible to bend a cube so that all its pairs of opposite faces come 
into contact with each other. However, another physical justification is 
often given. We have already remarked that the bulk properties of a metal 
cannot depend appreciably upon surface effects, so that, if we are not 
particularly interested in the latter, there is no harm in imagining our cube 
of side L embedded in a very large, theoretically infinite, block of metal. 
If we also imagine that this block is constructed of cubes of side L, then we 
see that the periodic boundary conditions follow from the very reasonable 
assumption that all these cubes have exactly the same properties — this is 
known as a cyclic system. It is important to remember, however, that such 
devices are not strictly necessary — the real justification of the periodic 
boundary conditions is that they lead to the correct density of states. 

Although we have so far considered only the Sommerfeld model of a 
metal, it is obvious that periodic boundary conditions will be even more 
appropriate when a periodic field is present, but we will defer this problem 
until the next chapter. It also need hardly be said that it is not essential 
to apply the boundary conditions to a cube — in the case of a rectangular 
parallelepiped of sides L l , L 2 , L 3 , (7.23) would become 

l l / k (x + L 1 , y, z) = *l/ k (xl y + L 2 , z) = 1 J/ k (x, y, z + L 3 ) = i{/ k (x, y, z), (7.24) 

and again the same density of states would result, with v = L i L 2 L 2l . 
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7.4. The Hartree Method Applied to a Free-Electron Gas 

In the Sommerfeld theory the electrons are treated as completely free, 
that is to say, as non-interacting particles moving in zero external field. 
The term free-electron gas is generally applied to a system of electrons 
moving in zero external field even when the interactions of the electrons 
among themselves are explicitly taken into account. The effects of the 
electronic interactions, and Coulomb correlations in particular, upon the 
energy levels and total energy of a free-electron gas will occupy a good 
deal of our attention in the remainder of the book. This is mainly because 
the free-electron gas is the only model for which accurate results can be 
obtained with a reasonable amount of labour. Fortunately, it is not of 
purely academic interest, for we know that the electrons in several metals 

— the alkali metals, for example, sodium in particular, and also some 
polyvalent metals with small ion-cores, such as magnesium and aluminium 

— behave very much like free electrons. In the present section we will 
apply the Hartree theory to a free-electron gas and shall discover that the 
results are the same as those of the Sommerfeld theory — the Hartree 
model is a more sophisticated one, however, and we will take the opportunity 
of reframing our previous work in terms of the vector k, appearing in 
the function (7.22). 

If the electrons are assumed to interact, then, in order to stabilize the 
system — that is, in order to prevent the electron gas from dispersing — 
we must introduce a uniform distribution of positive charge just sufficient 
to neutralize the charge of the electrons. The approximation involved is 
thus simply one of replacing the actual charge distribution of the ion- 
cores by a uniform distribution of the same total charge. Let us then con¬ 
sider a cube of side L, containing N electrons and a uniform positive charge 
distribution of density eN/L 3 . When periodic boundary conditions are 
applied, with running waves, the average electronic charge distribution is 
also uniform, so that the total Hartree field is zero — this will be seen below 
to be a self-consistent solution of the problem. 

The Hartree equation is thus, simply, 

_ VV = (7.25) 

2m 

If e lt e 2 , e 3 denote unit vectors along three mutually orthogonal cube 
edges, which we may take as axes (see Appendix 1), then 
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k = k l e l +k 2 e 2 + k 3 e 3 , 
r = xe l +ye 2 +ze 3 , 


and 


k • r = k 1 x+k 2 y+k 3 z. 


It follows that 


V 2 e ik ’ r = —k 2 e ik ' r , 


and a solution of equation (7.25) is 


MO = 


where C is any constant, provided that 

h 2 k 2 h 2 

s = e(k) = fl ^ =^(k 2 1 + k 2 2 + kl). 

7m 7m 


(7.26) 


(7.27) 


Applying the periodic boundary conditions (7.23) to ij/ k , we fin d 

e'* lL = e'* 2L = Q ikiL = 1, 

so that 


= 


2nn 1 u _ lnn 2 u _ 2nn 3 


k, = 


k, = 


(7.28) 


L L L 

where rt t , n 2 , n 3 may be positive or negative integers or zero. Hence, 


. 2n . 

k = — ( n 1 e l +n 2 e 2 + n 3 e 3 ). 


(7.29) 


The vector k is known as the wave vector of the state whose wave function 
is \}/ k , which is often referred to simply as the ‘state k\ The magnitude k is 
equal to 2n/X, where A is the wavelength of the wave represented by i fi k 
— for example, in the case of motion in the x-direction, it is easily seen 
that the function e'* 1 * is periodic in x with smallest period 2n/ki, since 

_ ^iJci(x + 2njki) 

In order to normalize the wave functions, we write 

1 = J hZ-Ipdr = C 2 Jdr = C 2 L 3 , 
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the integrals being taken throughout the cube. The wave functions 



(7.30) 


VL 3 


are therefore normalized. We note that |i/^| 2 = 1/L 3 , so that the average 
charge distribution is uniform, and the average charge density of all the 
electrons is — eN/L 3 — the effect of this is exactly cancelled by the assumed 
uniform positive charge distribution, giving the self-consistent result that 
the Hartree field is zero. 

The energy levels, from equations (7.27) and (7.28), are given by 



Comparing this with (7.2), it may be seen that only one in eight of the 
original levels are now present — those corresponding to even integral 
values of n x , n y , n z . However, owing to the fact that positive and negative 
values of n l9 n 2) n 3 now represent different states, the degeneracy of each 
level is eight times as great as before — in other words, there are now 
eight times as many states to each energy level. The density of states jV'lf) 
therefore remains unchanged. We will now verify this by another method. 

Instead of the ‘w-space’ which we introduced in § 7.2, it will be convenient 
to define a ‘/c-space’, whose rectangular Cartesian axes represent k lf k 2 , 
and k 3 . Equations (7.28) tell us that points in k-space with coordinates 


( 



where n 2 , n 2 , and n 3 are integers, represent orbital states, and a cube of 
side 2n/L therefore contains one orbital state. It follows that the number 
of states per unit volume of fc-space is L 3 /Sn 3 , or v/Sn 3 . We may therefore 
state that, 

for a metal of volume v, the number of orbital states lying within a volume 
element d k of k-space is vdk/Sn 3 . 

We assume, of course, that, although d k may be chosen to be very small 
compared with the occupied volume of k- space, it will none the less contain 
a large number of states — the approximations involved in the use of this 
formula are, in fact, the same as those we made in our previous calculation 
of J^Ce). 

Equation (7.27) shows that the surfaces of constant energy in k- space 
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are spheres, centred at the origin. At the absolute zero of temperature, 
the occupied region of k-space is thus a sphere, called the Fermi sphere, 
whose radius we will denote by k 0 . Since each orbital state is occupied 
by two electrons, twice the number of states within the Fermi sphere must 
equal N, that is 


2 


V 4n 

8tt 3 3 



and 


We now find 


k 0 = 


Co = 


h 2 kj = hl /9 
2m 2m \ 4 / r 2 ’ 


(7.31) 


in agreement with equation (7.11). 

Before finding the density of states for the present model, it will prove 
useful for our future work to obtain a general expression for jV (e) which 
applies to all cases in which the surfaces of constant energy in fc-space 
are spheres, centred at the origin, whether e is given by equation (7.27) 
or not. In other words, let us assume nothing more than that e is a function 
of the magnitude k of the wave vector only, that is, e = e(k). Then the 
number of orbital states with energies .less than e is 


3 8 n 3 6n 2 


The density of states in energy is the derivative of this with respect to s, 
so that 




de ' 

\6n 2 

!| 

fjL 

d k 

\6^ 2 

vk 2 

j de 

2n 2 / 

' d k 


de 
d k 


(7.32) 
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We shall have occasion to use this formula later. In the present case of 
the Hartree method, we have 


and 



de 
d k 


h l 

m 


k, 


as in equation (7.5). 


•^00 - 


v m 
2rt 2 h 2 


k 






7.5, The Hartree-Fock Method Applied to a Free-Electron Gas 


The Hartree-Fock equations for a monovalent metal were found in 
§ 6.2, equation (6.22), to be of the form 


1,2 


Tv ’ 


-i-vf+I W ,)+« 2 

2/71 a 


V C \U'2)\ 2 

j J r 12 



i) 


N 

~e 2 Z 

j 

(spin j 
= spin i ) 

We now wish to solve this equation for the free-electron system described 
in the previous section. We shall show that the normalized functions 


rizM'i) 


d r-, 


<A>Oi) = i)- (7-33) 


Wr) = -kc , ‘-' > (7.34) 

■Jo 

which we found to be eigenfunctions of the Hartree equation, are also 
eigenfunctions of equation (7.33) — corresponding to different eigenvalues, 
however. Anticipating this fact, we may omit the second and third terms 
in the first square brackets, since these represent the Hartree field, which 
we know to be zero for the present model. Also, from now on, we will 
use the subscripts k and k' to distinguish between eigenfunctions, rather 
than the subscripts i and j used in equation (7.33). The Hartree-Fock 
equation for the state k thus becomes 
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2m 


v iWfi)-'T J 



= e(k)Mr,)- ( 7 - 35 ) 


The sum over k' is a sum over all the occupied orbital states, so that, at 
the absolute zero of temperature, when each of these states contains two 
electrons with opposite spins, the sum over k' is the same as a sum over all 
electrons with one kind of spin, and a sum over all electrons with the 
other kind of spin would be precisely the same. The specification 
“spin j = spin i”, or “spin k' = spin fc”, thus need not be stated explicitly. 
The integrals are, of course, taken throughout the volume of the metal. 

Now, if we use the functions i j/ k given in equation (7.34), the exchange 
term, which is the second term on the left-hand side of equation (7.35), 
becomes 



(7.36) 


The integral may be written 



(7.37) 


and in order to evaluate it we make use of a device suggested by electrostatics. 
Suppose that <})(r ) is the potential at the point r due to a distribution of 
charge whose density is given by the function e ,( *~*' )-r . This is not a phy¬ 
sically real distribution, of course, but may be treated as such for purposes 
of calculation. Then <f>(r) must satisfy Poisson’s equation 

V 2 <f>(r ) = -47re i(fc ' fc),r , 

the required solution of which, as may be verified by substitution, is 

(7 - 39 > 

The potential at the point r u which is 


A 4 . 

must, according to (7.39), have the value 



(7.40) 
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4nt Kk ~ k ' yri 

and the expression (7.37) must therefore be equal to + 

4n 

\i^-W 


The exchange term (7.36) may thus be written 


where 


£ xO)<A k Oi), 


£ x(*0 = 


4ne 2 y 1 
v w\k-k'\ 2 


(7.41) 


We will call £ x (k) exchange energy associated with the state k. This 

proves that the functions i j/ k , as given in (7.34), are 
eigenfunctions of the Hartree-Fock equation, and the 
eigenvalues are given by 

£ (k) = t-lL + e x (k). (7.42) 

2m 

In order to evaluate c x (k) we make use of the fact 
that a volume element dk of k-space contains vdk/Sn 3 
orbital states, and replace the sum in (7.41) by an 
integral, so that 



4ne 2 v f dk' 

v 8ti 3 J |k —k'| 2 


The integral is taken throughout the Fermi sphere. If we use spherical 
polar coordinates (k\ 9, 4>), where 6 is the angle between k' and k (see 
Figure 7.6), we find 


t This result is not strictly accurate, since the value of the integral in (7.36) must depend 
to some extent upon the position of r x — the integral is taken throughout the cube of 
side L , and its value when r x is very close to a face of the cube must be different from 
its value when r x is in the centre. The approximations involved in obtaining this result, 
independent of r u however, are simply those which are implicit in the use of periodic 
boundary conditions. 
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» 2 ti ;,/2 


£x(*) = 


k' 2 sin 0 d 4> 


-—f °d k; fde r 

2u J 0 Jo Jo I* —*'l 

fc ' 2d ( cose ) 

71 J Q J _ 1 

e 2 C k ° I 

-f- k’ log 

71rC q | 


k 2 + k’ 2 — 2kk' cos 6 
k+k' 


k-k f 


(7.43) 


We note that the integrand becomes infinite when k' = k, so that, for 
k < k 0 , the integral must be expressed as the sum of the integral from 
0 to k and the integral from k to k 0 , and the convergence of both integrals 
must be examined. In fact, both integrals converge, and (7.43) may be 
evaluated using elementary integration by parts. The result, which is true 
for both k < k 0 and k > k 0 , is 


£x(*0 



k 2 0 -k 2 

kk 0 


log 


k 0 + k 

— 

1 

O 

_ 


(7.44) 


The Hartree-Fock energy associated with state k, referred to the same 
zero as in our application of the Hartree theory, is thus 


s(k) = 


2m 




k 0 + k 
k 0 -k 


(7.45) 


This is a spherically symmetrical function, that is, e(k) = e(k), as in the 
Hartree case. 

It may be verified that 


and 


so that 


k 2 -k 2 

lim K _±JL jog 
k^o kk 0 


k 2 -k 2 

lim — -log 

k^k 0 kk 0 


k 0 + k 

k 0 -k 

k 0 + k 
k 0 — k 


= 2 , 


= 0, 


S (0) = - ^ and e ( ko) = _ Ai 

n 2m n 


The band width for any metallic system may be defined as the difference 




























Ch. 7, § 5] HARTREE-FOCK METHOD APPLIED TO FREE ELECTRONS 173 


between the highest and the lowest one-electron energies at 0°K. The 
Hartree-Fock band width for a free-electron gas is, therefore, 

h 2 k 2 p 2 k 

e(fc„)-£(0) = ^ + —« 

2m n 

y £ kri 

= (o+ — 9 - (7.46) 

n 


The band width according to the Hartree theory is, of course, just £ 0 . 
The additional band width, e 2 k 0 /n, in the case of the Hartree-Fock theory, 
is considerable — it amounts to 4.1 eV for sodium, while Co is only 3.2 eV 
for this metal. The true band width, which, as we shall see, can be roughly 
estimated from soft X-ray emission data, favours the Hartree rather than 
the Hartree-Fock theory. 

The band width is the width of the occupied part of the distribution of 
states in energy, so it is perhaps not surprising that the shape of the^V(e) 
curve is quite different in the two theories. Since the surfaces of constant 
energy in k-space are again spheres, we may use formula (7.32) in the Hartree- 
Fock case also, giving 


^(s) = 


vk 2 j de 
In 2 ! d k 



~h 2 k e 2 

l 2k o _ 


log! 

)] 

_ m 2n 

\ k 

\ k 2 / 

1 k„-k 

JJ 


(7.47) 


The denominator, de/d k, becomes infinite when k = k 0 , so that ^(e) 
falls to zero at the surface of the Fermi sphere. 

Since we cannot obtain k as a simple function of e, the^f / '(e) curve must 
be found numerically by calculating both ^V(e) and e for chosen values 
of k — the result, for the electron density occurring in metallic sodium, 
is compared with the Hartree curve in Figure 7.7. There is one important 
distinction between the two curves which is often overlooked. The parabolic 
Hartree curve is the same for all temperatures and for all values of 
N — only the function /(e), which is the density of occupied states, 
changes. This is not so in the Hartree-Fock case — here the function 
jV' (e) itself depends upon the occupation of states, since the exchange energy 
e x (k) does. We have assumed the temperature to be 0°K: at higher tem¬ 
peratures, electrons would not simply be excited to states in the unoccupied 
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part of the jV{e) curve, but the shape of the curve would be altered, owing 
to the change in e x (k). Similarly, a larger value of N would not result in a 
filling up of unoccupied states lying beyond the sharp fall to zero in the 
^V(e) curve, but would result in a displacement of this zero — at 0°K the 
zero always occurs at the energy of the highest occupied state. It follows 
that the unshaded portion of the Hartree-Fock curve in Figure 7.7 



€(eV) 


Fig. 7.7. Density of states c/K(£) for a free-electron system with the 
same electron density as metallic sodium: (a) according to the 
Hartree-Fock theory at 0°K, (b) according to the Hartree 
theory, both referred to the same zero of energy. The shaded 
regions are occupied at 0°K. 

has no meaning in a rigorous sense — however, it does show to a good 
approximation the density of possible states of an electron shot into the 
metal from outside, since the presence of one additional electron cannot 
appreciably affect the jV(e) curve. Similarly, we expect the curve to have 
roughly the same shape at temperatures not too much above 0°K, even 
though the sharp zero no longer appears. 

Many physical properties of metals depend upon the shape of the ^(e) 
curve, particularly near the highest occupied energy level. Such properties 
are the electronic specific heat, soft X-ray emission spectrum, and spin 
paramagnetic susceptibility, all of which we shall consider in some detail 
later. It is clear that, as far as these properties are concerned, the Hartree 
and Hartree-Fock methods will give very different results — and those 
of the Hartree method are found to be in better agreement with experiment. 
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We know that this superiority of the Hartree method must be purely for¬ 
tuitous, however, for this method completely ignores Coulomb correlations. 
The Hartree-Fock method, on the other hand, as we have seen in § 6.3, 
takes some account of parallel spin correlations, while neglecting anti¬ 
parallel spin correlations. It would seem that, as far as the density of states 
is concerned, it is better to ignore all correlations than to include only some 
of them. None the less, the Hartree-Fock method gives a better total energy 
than the Hartree method. This will be proved below, but it is suggested by 
Figure 7.7 — the Hartree-Fock ^(e) curve is displaced considerably to 
the left of the Hartree curve, so that, in spite of the larger band width, 
the Hartree-Fock total energy is expected to be much lower than that of 
the Hartree method. We shall show in Chapter 9 that this lowering of 
the energy is very necessary, for the Hartree method itself gives a negative 
cohesive energy, that is to say, no cohesion at all, whereas the Hartree- 
Fock method does give a positive cohesive energy, although it is still much 
too small. It will be seen later, in Chapter 10, that, when proper account 
is taken of all Coulomb correlations, the density of states becomes very 
similar to that of the Hartree theory and the total energy is lowered 
still further, giving excellent agreement with the observed cohesive 
energies. 

We will now calculate the total energy of a free-electron gas according 
to the Hartree-Fock theory. This is not obtained simply by summing the 
eigenvalues e(k) over all the electrons, which would, in fact, include the 
total exchange energy twice, but the correct expression may be deduced 
from either equation (6.18) or equation (6.24). Since there are no ion- 
cores in the free-electron model, but only a uniform distribution of positive 
charge which exactly cancels the average distribution of electronic charge, 
the second and fourth terms on the right of equation (6.24) disappear, 
and the total energy becomes 


Z e i + i Z Z [ f~ l l y i( r i) l l'*j( r 2) l l / i( r 2) l l / j( r i)dr i dr2, 

i i j J J V 1 2 


(II spins) 


in our earlier notation. We will denote this by NE HF , where E m is the 
average Hartree-Fock energy per electron, and also translate it into the 
‘k-notation’. Remembering that a sum over k is a sum over orbital states, 
so that it must be multiplied by two to obtain a sum over electrons, 
we find 
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NEup = 2 X e(k)+ Z Z f f — dr 2 

k u k'J J r 12 
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(7.48) 


The first term is the total kinetic energy, or NE F , where E ¥ is the Fermi 
energy given by equation (7.12) — this is, in fact, the total energy in the 
Hartree approximation. The second term is the total exchange energy, 
which may be evaluated by replacing the sum by an integral over the 
Fermi sphere, as we did in evaluating £ x (fc). This gives 



l e2 Ji (±V i 

4 \47r 2 / r s 


(7.49) 


using (7.31). 

Denoting the average exchange energy per electron by E x , then, we have 


E 


x 


—(jlv 1 

4 \47r 2 / r s 


(7.50) 


The average energy per electron in the Hartree-Fock method applied to 
free electrons is, therefore, 


Ehf — E f + E x 

= - l L 1 _ 3e^ 1 

10m l 4 / r 2 4 \47i 2 / r s ’ 


(7.51) 
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It is convenient to use the ‘atomic units’ defined in § 3.3, that is, to express 
energies in rydbergs and lengths in Bohr units. In this case, we find 


and 


- 


3b 1 


10 m 
(9n\* 1 


/9 1 /WV/2 b 2 \ 

\ 4 / rl \ h 2 / \meV 


= 3 1 
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ryd, 
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4 \47i 2 / r s l h 2 / Imev 
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2 \4ti 2 / r. 


0.916 
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(7.52) 


(7.53) 


The average energy per electron in the Hartree-Fock method is thus 


^ 2.21 0.916 

£hf = —r-ryd, 

r s r s 


(7.54) 


while in the Hartree method it is simply the Fermi energy E F . For all 
the monovalent metals r s lies between about 3 and 6 B.u., so that the 
exchange energy lowers the value of F'hf considerably — making it negative, 
in fact. 


7.6. The Exchange Charge Density and the Fermi Hole in a Free-Electron gas 


We will first calculate the exchange charge density in the free-electron 
gas discussed above. The general expression for the exchange charge 
density is given by (6.25). Translating this into our ‘fc-notation’, we find 
that the exchange charge density at position r 2 for an electron in state k 
at r t is 


g y > l / k’( r 2 ) l A fc( r 2 ) t / / fc ( r i) 
*■ <Afc0i) 


(7.55) 


Using (7.34) and, as usual, replacing the sum in (7.55) by an integral 
throughout the Fermi sphere, this expression becomes 


e f e i(k'-fc)-(ri-r 2 ) dfc 7 _ J_ e -ifc- (n-n) j gift'■ ('•>~ r 2) c y c , > 

8n 3 J 8n 3 J 


(7.56) 
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In order to evaluate the integral we use spherical polar coordinates 
(k\ 9, (j)), where 9 is the angle between the vectors r 1 — r 2 and k\ The 
exchange charge density is then 


— e" ifc • (ri_r2) ( k °2nk' 2 dk' f e iit>I2COS0 d (cos 9) 
8ft 3 Jo j -1 


_^ e _ ik . (ri _ r2) r*° k! sin fc'r 12 
27 1 2 J 0 r l2 


dk' 


sin k 0 r 12 — k 0 r 12 cos k 0 r 12 
1 12 

The magnitude of the exchange charge density thus decreases rapidly as 
r 12 increases, and it is easily verified that, in the limit as r 12 ^ 0, its value 
becomes eN/2v, which is equal in magnitude, but opposite in sign, to 
the average charge density of electrons with one kind of spin. We see, 
however, that (7.57) is not spherically symmetrical about r lf that it depends 
upon k, and that it is a complex function. 

A more useful picture is obtained by averaging the exchange charge 
density over all electrons with one kind of spin, or over all values of k 
within the Fermi sphere. This also provides a more correct picture, as far 
as our interpretation is concerned, for, as we remarked in § 6.3, when a 
determinantal wave function is used, it is not correct to regard the electron 
at as being in a particular one-electron state, but it must be regarded 
as partially in all of them. Since there are N/2 electrons with one kind of 
spin, the average exchange charge density at r 2 for an electron at r t is 


• (n ~r 2 ) 


(7.57) 
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(7.58) 


£ = k 0 r 12 



where 
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Adding (7.58) to —eN/2v, we find that the average charge density, at 
distance t,jk 0 from an electron at r t , of electrons with parallel spins is 


eN 

- ^ m> 

2v 


where 


F K ) = i-9 ^-f cos < y 


(7.59) 


The graph of the function F (£) is shown in Figure 7.8. It is at once ap¬ 
parent that the description of the average Fermi hole, as a spherical hole of 
approximate radius r s in the distribution of electrons with parallel spins, 
is a very reasonable one. 



Fig. 7.8. Graph of the function F(|). At distance |//c 0 
from a given electron the average charge density of electrons 
with parallel spins is — eN F(^)/2v (if the electronic motions 
were uncorrelated this would be just —eN/2v). 

The basis of the foregoing work was the Hartree-Fock equation (7.35), 
and the treatment has been of a rather intuitive nature. It is instructive, 
and some may find it more satisfying, to obtain the same result directly, 
without explicit use of the Hartree-Fock equation, by showing that the 
probability of simultaneously finding two electrons with parallel spins in 
volume elements a distance r 12 apart is proportional to F(£). This may 
be derived, rather lengthily, from the probability density of the whole 
electron distribution, that is, the squared modulus of the total determinantal 
wave function, or more simply in the following way. 

Let us consider at first only two free electrons, with parallel spins, in 
the states k and k'. If we assume that the snins are positive, then the deter- 
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minantal wave function for this system becomes 


Vyjl 


_i k • r i 


Ak' • i*i 


k • T2 


a i k' • i*2 


a(CXQ. 


e" ' ‘ e 

which is normalized in the volume v. The probability that r 1 hes in the 
volume element dr l and r 2 in the volume element d r 2 is, thus, 
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(7.60) 


where stands for the real part of the function following it. 

The same expression holds for all pairs of states k and k’ associated 
with the same spin. In order to obtain the total probability, therefore, we 
average over all k and k' within the Fermi sphere. Hence, for our 77-electron 
system, the probability that electron 1 is in d r 2 and simultaneously electron 
2, with parallel spin, is in dr 2 is 7 
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(7.61) 


t We know that two electrons with the same spin cannot occupy the same orbital state. 
It is unnecessary to take this into account in the double summation, because the value 
of (7.60) is zero anyway when k = k\ However, in order to obtain the correct average, 
we should really divide by the number of different pairs of k and k\ namely, \N(\N— 1), 
instead of J7V 2 , as we have done, but N is so large that the difference between the two 
is negligible. 
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The integrals in the third line are equal in value and both real, as we have 
found above, so that the 0t, does not affect the result. 

The probability of finding a given electron in the volume element dr, 
regardless of the positions of the other electrons, is dr/p. It follows that, 
if electron 1 is known to be in dr l5 the probability of electron 2, with 
parallel spin, being in dr 2 is 



v 


and the average charge density of electron 2 at r 2 is, therefore, 


- £ m- 


Hence, the average charge density at r 2 due to all electrons with spins 
parallel to that of the electron at iq is 


eN 

2v 


no. 


as we found previously. 

If the motions of the electrons were completely uncorrelated, unity 
would be substituted for F(£) everywhere, so that it is the second term of 
F(£), equation (7.59), which represents the effects of correlations due to 
the Pauli principle. It should be noted that the Coulomb interaction of the 
electrons plays no part in the foregoing calculation — the probability (7.61) 
would be exactly the same if the electrons did not interact at all, provided 
only that the total wave function were of determinantal form-. However, 
if the electrons did not interact, the correlated motion represented by 
(7.61) would not affect the energy of the system — in other wo^ds the 
exchange energy would be zero. 


APPLICATIONS OF THE FREE-ELECTRON THEORY 

7.7. Electronic Specific Heat at Low Temperatures 

There are two distinct contributions to the specific heat of a metal: 
one from the thermal vibration of the ion-cores, which is called the lattice 
specific heat, and the other, called the electronic specific heat, from the 
thermal motion of the valence electrons. We only wish to consider the latter 
here. Fortunately, the two can be separated at low temperatures, because 
it is found that, at temperatures T near absolute zero, the lattice specific 
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heat is proportional to T 3 while the electronic specific heat is proportional 
to T. The total specific heat at low temperatures is, thus, 

C = yT+AT 3 , (7.62) 

where y and A are constants, so that, if CjT is plotted against T z , the result 
is a straight line which intersects the C/T axis in the value of y, giving the 
electronic specific heat. 

We will not obtain an explicit formula for the electronic specific heat, 
which involves the use of the Fermi-Dirac distribution function, but will 
be content to obtain the temperature dependence of it by general arguments. 
In particular, we will show that the Sommerfeld-Hartree method gives the 
correct temperature dependence, while the Hartree-Fock method does not. 
The actual values of the electronic specific heat, as given by the Sommerfeld- 
Hartree method are in reasonable agreement with the experimental values 
for the ‘free-electron’ metals. 

At very low temperatures, as equation (7.62) shows, the electronic specific 
heat is larger than the lattice specific heat, but at normal temperatures the 
electronic specific heat is negligibly small compared with the other. This 
state of affairs could not be explained until quantum mechanics was applied 
to the problem. According to classical statistical mechanics, in a gas of 
free particles at temperature T °K each particle will have on the' average 
an energy -j kT Since, in order to explain the conductivities of metals, it 
was necessary to assume that the number of conduction electrons was of 
the same order as the number of atoms, this meant that, according to 
the classical theory, the electronic and lattice specific heats should have 
similar magnitudes. That this is not so is easily understood from our work 
on the free-electron gas. We have seen that at normal temperatures all the 
one-electron states of low energy are filled, and that the only electrons 
which may accept energies of the order of kT are those occupying states 
near the Fermi surface. Thus, although the excited electrons have a normal 
specific heat, they are so few in number that their contribution is negligible 
compared with the lattice specific heat, which at ordinary temperatures is 
about 3 k per atom. 

(i) Sommerfeld-Hartree theory. According to this theory of a free-electron 
gas the graph of the density of states function ^(e), as shown in Figure 7 7 
is a parabola, and, at low temperatures, ^( £ ) does not change much from 
^k(Co) m an energy range of order kT about C 0 . The number of electrons 
which can accept thermal energy of order kT is therefore proportional 
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to JT (Co) kT, and the increase in thermal energy when the temperature is 
raised from 0°K to T° K is proportional to ^V(£ 0 ) (kT) 2 . Differentiating 
this with respect to T, we find, for the low temperature electronic specific 
heat C v at constant volume, 

C, oc T, (7.63) 

in agreement with experiment. The same result will always follow provided 
^V{e) does not vary rapidly near the Fermi surface. 


(ii) Hartree-Fock theory. In this case we do not expect to obtain the correct 
temperature dependence of C v , for we have seen that jV~( e) falls rapidly 
to zero at the Fermi surface and then rises again — we may assume that 
the jF( e ) curve, shown in Figure 7.7, is essentially the same at very low 
temperatures as at absolute zero. The electrons which can accept thermal 
energy are again those which lie within an energy range of about (iT from 
the Fermi surface, but this time we cannot obtain an estimate of their 
number simply by multiplying kT by the value ofyf / '(e) at the Fermi surface, 
since this is zero. In order to find the required number, we must first obtain 
the value of k for a state whose energy lies kT below the energy e(k 0 ) at 
the Fermi surface, T being very small — in other words, we must find k 
such that 

s(k 0 )-s(k) = kT. (7.64) 

Then, if we write 

k = (1 -x)k 0 , 


the number of electrons which can accept energy kT is the number lying 
within a spherical shell of thickness xk 0 at the Fermi surface, that is, 


2 . . 2 , kl 

— 3 4nk 0 xk 0 = ~z 

8 7i n 


x, 


(7.65) 


for unit volume of metal. The increase in thermal energy when the tem¬ 
perature is raised to T will be proportional to kT times this. 

From equation (7.45) we have 
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(7,66) 
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We may assume that x is small enough for its square to be neglected, and 
2/x is thus very much greater than unity, so that, as a good approximation, 

e(k) = (l — 2x)~ -J- (2k 0 + 2xk 0 log-). 

2m 2n \ xl 

Using this and (7.66) in (7.64), therefore, we obtain 

arp h 2 kl e 2 k 0 2 
HT = --xH- - x log - . 


m 


n 


As x approaches zero, the first term on the right becomes negligible com¬ 
pared with the second, and we may write 

kT = ^2* log-, 
n x 

approximately. The solution t of this equation is very nearly 

nkT /, f 2e% \ 


x = 


e k 0 > 


log 


■ 2 _i 
nkT J' 


(7.67) 


From (7.65) and (7.67), ignoring constant factors, we find that the 
increase in thermal energy on raising the temperature from absolute zero 
to a very small temperature T is proportional to 

„2 i 


log 


( 2e% \ 
l nkT / 


Differentiating with respect to T, we find, for the low temperature specific 
heat, 
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I This may be seen as follows: 
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When 6 is very small a good enough approximation is, clearly, 
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and the second term is negligible compared with the first. As a further 
simplification, we may write 



and neglect the first term on the right as T approaches zero. We then 
have, finally 


C„ oc I 1 /log T. 


(7.68) 


According to the Hartree-Fock theory, therefore, as T tends to zero, 
CJT also tends to zero, instead of to the finite value found experimentally 
and as given by the Sommerfeld-Hartree theory. We shall see, in Chapter 10, 
that the correct temperature dependence is restored when proper account 
is taken of Coulomb correlations. 

7.8. Spin Paramagnetism 

If a metal (or any other substance) is placed in a magnetic field H, it 
becomes magnetized, the magnetic moment per unit volume M being 
related to H by the equation 


(7.69) 


M = X H, 


where % is the magnetic susceptibility of the metal. Defined in this way, % 
is known as the volume susceptibility, but it should be noted that the sus¬ 
ceptibility per unit mass and the susceptibility per atom, defined in terms 
of the magnetic moment per unit mass and per atom respectively, are 
frequently used. 

If x is negative, the metal is diamagnetic, if X is small and positive, the 
metal is paramagnetic, and, if X is large and positive, the metal is ferro¬ 
magnetic. Most metals are weakly paramagnetic or diamagnetic — of the 
monovalent metals, for example, the alkali metals are paramagnetic, while 
the noble metals, copper, silver, and gold are diamagnetic. Excluding alloys, 
the only metals known to be ferromagnetic are iron, cobalt, nickel, and 
gadolinium, and we will omit these from consideration here. 

Diamagnetism is due to the orbital motion of the electrons, whereas 
paramagnetism is due to the electron spin. It may be shown that, except 
in the case of some transition metals, the contribution to the susceptibility 
from the ion-cores is diamagnetic, so that any paramagnetic susceptibility 
must be due to the conduction electrons. We will now derive an expression 
for the paramagnetic susceptibility of the conduction electrons according 
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to the Sommerfeld-Hartree theory of a free-electron gas. Although we 
will assume, for simplicity, that the temperature is absolute zero, it is not 
difficult to show, using the Fermi-Dirac distribution function, that the sus¬ 
ceptibility is practically the same at all normal temperatures. 

When there is no external magnetic field, all the orbital states of lowest 
energy are occupied by two electrons each, with opposite spins. We may 
thus draw two parabolic ^V(e) curves, occupied up to the same energy, 
as in Figure 7.9 (a), one being for electrons with positive spin and one for 
electrons with negative spin. We know that, when a magnetic field H is 
switched on, an electron whose (spin) magnetic moment is parallel to the 
field will have its energy decreased by the amount n B H, where ji B is the 
Bohr magneton, given by 

eh 

Hb — ~ > 

2 me 

and an electron with its magnetic moment antiparallel to the field will have 
its energy increased by the same amount. Thus, in the presence of a magnetic 
field ##, the density of states curves for the electrons with spins parallel 
and antiparallel to the field respectively will become displaced relative to 
one another by an energy 2 /i b H, as shown in Figure 7.9 (b). 

£ £ 




Fig. 7.9. Distribution at 0°K of free electrons with spin magnetic moments parallel and 
antiparallel to the direction of a magnetic field H. (a) before, and (b) after the field is 
switched on. e is the total one-electron energy, kinetic plus magnetic. 

The number of electrons in each distribution will not remain the same, 
however, for this would mean that the maximum energy of the electrons 
with magnetic moments parallel to the field would be greater than that 
of the others, and a lower energy could be obtained by transferring some 
of the electrons from the higher distribution to the lower. The lowest 
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energy is obtained when both distributions are filled up to the same energy 
(kinetic plus magnetic). 

Now, even for fields of several hundred thousand gauss, /u B H is only 
of the order of 10" 3 eV, so that in an energy range whose width is /u B H, 
at Co, we may take the density of states to be ^(( 0 )- The number of elec¬ 
trons which turn their magnetic moments parallel to the field is, thus, 

Z'bH/TXCo), 

which means that the number of electrons with magnetic moments parallel 
to H is how 


2/iaH'nCo) 

greater than the number with magnetic moments antiparallel to H. The 
magnetic moment per unit volume is therefore 


2^Hjr(c 0 )iv, 

parallel to ##, so that, from equation .(7.69), we find the paramagnetic 
susceptibility to be 


X = 2n 2 B J^(t 0 )lv. 

From equations (7.5) and (7.7), we have 

x vm /3tc 2 JV\* 

so that, if n = N/v is the electron density, 




n 2 h 


(7.70) 


(7.71) 


This is the paramagnetic susceptibility of a gas of free electrons, calculated 
according to the Hartree approximation. The values given by the formula 
cannot easily be compared with experiment, even for those metals, such 
as sodium, for which the valence electrons are very nearly free. This is 
because the total susceptibility, which is what is generally observed, consists 
of the sum of three terms — in addition to the spin paramagnetism of 
the valence electrons, there is also a diamagnetic contribution from the 
valence electrons, due to the effect of the magnetic field upon their orbital 
motion, and a diamagnetic contribution from the ion-cores. The latter 
can be estimated, however, on the assumption that the ion-cores are 
the same as in a free atom, and it can also be shown that, for perfectly 
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free electrons, the diamagnetic contribution is exactly — | of the para¬ 
magnetic contribution. The total susceptibility calculated in this way is 
of the right order of magnitude for some simple metals, but the corre¬ 
spondence between the calculated and experimental values is not close. It 
should be pointed out, however, that the experimental values themselves 
are not very consistent, probably owing to the effects of ferromagnetic 
impurities. 

If the Hartree-Fock method is used to calculate the paramagnetic sus¬ 
ceptibility, it is clear from the density of states curves shown in Figure 7.7 
that the value will be very different from that given by (7.70). The situation 
is complicated by the fact that a change in the number of electrons with 
either kind of spin will alter the exchange energy, which depends upon the 
numbers of electrons with parallel spins. Detailed calculation shows, in 
fact, that the inclusion of the exchange energy increases the calculated 
susceptibility considerably — in the case of sodium it multiplies the value 
given by (7.71) by a factor of about three, which is certainly too high. 
This, again, is due to the inclusion of parallel spin correlations while com¬ 
pletely ignoring antiparallel spin correlations — when Coulomb correlations 
are properly taken into account, as in the method described in Chapter 10, 
the resulting susceptibility is little greater than that given by the Hartree 
method, and in better agreement with experiment. 

We shall see in Chapter 9 that the effect of the periodic field of the ion- 
cores on the wave functions of the valence electrons can sometimes be 
taken into account by substituting for the electronic mass m an ‘effective 
mass’ m*. If this is done in formula (7.71), the result is little different in 
the case of sodium, for which mjm* is very nearly unity, but is in much 
better agreement with experiment in the case of lithium, for which m/m* 
is 0.69. 

7.9. Soft X-Ray Emission Spectra 

If a piece of metal is irradiated with cathode rays, some of the ions will 
lose electrons; that is, vacancies will occur in some of the inner shells, or 
bound levels, and valence electrons may fall into these vacancies with 
the consequent emission of X-rays. If e x is the energy of the valence electron 
before the transition, and e 2 the energy of the bound ievel, the frequency v 
of the emitted X-ray is given by 

—s 2 = hv. 
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The intensity of X-rays with frequency v will depend upon the density of 
valence electrons with energy gj, and, since there is a quasi-continuous 
distribution of valence electrons in energy, called the conduction band, 
the result is an X-ray spectrum of continuously varying intensity (see 
Figure 7.10). If everything else were equal, the intensity of the X-ray spectrum 
would exactly reflect the density of states jV( e) of the valence electrons. 




Conduction 

band 


N(8) 


Bound 

level 



Fig. 7.10. Schematic diagram showing 
the transition of a valence electron from 
the conduction band to a vacant bound 
level, together with the density of states. 


Fig. 7.11. Soft X-ray L z emission band 
for sodium (after Skinner, 1940). 1(e) is the 
intensity and v is the emitted frequency, 
the quantity I(e)/v 3 is comparable with^F(e). 
AB is the ‘empirical’ band width, 3.0 eV, 
and OB is the ‘reduced’ band width, 2.5 eV. 


Complications arise, however, owing to the fact Jhat the probability 
of a transition is not constant for all valence electron energies, but depends 
both upon the wave function of the bound level and that of the valence 
electron. None the less, in some cases the soft X-ray emission spectrum 
is expected to give a reasonable picture of the */C(e) curve, and, even when 
the shape of the curve is in doubt, at least the width of the band should 
be given with reasonable accuracy. The L 3 spectrum of sodium, which is 
due to transitions from the conduction band to bound 2p levels, is shown 
in Figure 7.11. 

It is difficult to fix the energy corresponding to the bottom of the band, 
owing to the presence of a long ‘tail’, which is not directly related to the 
jV'ip) curve. However, this may be done roughly by a linear extrapolation, 
or, what would appear more reasonable if the Sommerfeld-Hartree Jf (s) 
curve is a good approximation, a parabolic extrapolation of the main part 
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of the intensity curve. Skinner, from whose work t the figure was taken, 
called the band width resulting from the linear extrapolation the empirical 
band width, and that resulting from the parabolic extrapolation the reduced 
band width. The empirical and reduced band widths found by Skinner 
from the L 3 spectra of the metals sodium, magnesium, and aluminium are 
compared with the Sommerfeld-Hartree theoretical band widths in Table 1 , 
and the agreement is seen to be quite good. The calculated values are just 
the values of Co given by equation (7.11), remembering that the numbers 
of valence electrons per atom are one, two, and three respectively for 
sodium, magnesium and aluminium, and using the appropriate r t , given by 
equation (7.10), instead of r s . 


Table 1 


Band widths 

Na 

Mg 

Al 

Calculated (f 0 ) 

Empirical (Skinner) 
Reduced (Skinner) 

3.2 

3.0+0.2 

2.5±0.3 

7.3 

7.2±0.2 

6.2±0.3 

11.9 

13.2±0.4 

11.8±0.5 


All values are in eV. 


The Hartree-Fock band width, as given by equation (7.46), for these 
metals is about twice Co> so that, although the experimental band width 
is uncertain to the extent of an electron-volt or so, it is quite clear that the 
Hartree-Fock result is badly in error. Once again, it will be seen in Chapter 
10 that when proper account is taken of Coulomb correlations the band 
width returns to a value close to that of the Sommerfeld-Hartree theory. 

7.10. The Thomas-Fermi Approximation 

We will now describe a method, devised by Thomas and Fermi, which 
applies the ideas of the free-electron approximation to many-electron 
systems in which the potential energy of an electron is not constant. It 
has been used, with varying degrees of success, in calculating the electronic 
charge distributions in heavy atoms, molecules, and solids. 

We have seen that, for a free-electron gas of volume v, an element d k 
of fc-space contains v dfc/ 871 3 orbital states, and hence may accommodate 


t Skinner, H. W. B„ 1940, Phil. Trans. Roy. Soc. A, 239, 95. 
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2 y dk/8n electrons. Also, the momentum p of a free electron is given by 

p = hk. 
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Thus, if we consider a momentum space, or p-space, similar to k- space, 
we see immediately that a volume element dp may accommodate 
2v dp/8n 3 h 3 , or 2v dp/h 3 , electrons. Also, at the absolute zero of tem¬ 
perature, the occupied portion of p-space is a sphere of radius p 0 , where 
Po = . If the total number of electrons is N, therefore, we have 


and, if n — N/v is the number of electrons per unit volume, this gives 


8tc , 



The Thomas-Fermi approximation assumes that this relation remains true 
for the ground state of a system even when the electrons are not free, that 
is, when the potential energy is not constant. The electron density n and 
maximum momentum p 0 are now functions of the position vector r, however, 
so that we have 



(7.72) 


The approximation is clearly only valid if the electron density is so great 
that a volume element of real space in which the potential energy changes 
very little may still contain a large number of electrons. 

Now, let us suppose that —eV(r) is the potential energy of an electron 
due to the field of all the nuclei present and the whole of the electronic 
charge distribution. If we denote by -eV 0 the total energy, kinetic and 
potential, of the fastest electron, then the classical equation of energy 
tells us that 



or 


Po(r) = [2me(V- K 0 )]± 


and substitution in (7.72) gives 


Rtt 

n(r) = — [2 meiV-Vo)]*. 
3 h 


(7.73) 
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V 0 is, of course, independent of r, —eV 0 being simply the highest occupied 
energy level of the system. 

Now, for consistency, the potential F(r) must satisfy Poisson’s equation, 
that is, since the electronic charge density is —en(r), 


V 2 F = 4nen, 


or, using (7.73), 

V 2 V = q{V-V 0 j\ 

(7-74) 

where 

32n 2 e , 



" = 3*» (2me) ’• 



Equation (7.74) is the basic equation of the Thomas-Fermi theory and 
it must be solved, in general numerically, subject to appropriate boundary 
conditions. It should be noted that equations (7.73) and (7.74) are only 
valid so long as V > V 0 ; n(r) must be zero in any region where V < V 0 , 
that is, where the potential energy —eV is higher than the maximum energy 
— eV 0 of any electron. 

It is possible also to include the exchange energy in the theory, again 
using the free-electron formula, and various other generalizations have been 
made, but it would take us too far afield to consider these complications 
here. 

7.10.1. FIELD DUE TO A POINT CHARGE IN AN ELECTRON GAS 

As a simple application of the Thomas-Fermi theory let us find the field 
due to a point charge q introduced into a free-electron gas. Initially, before 
the introduction of q, the system consists of a uniform distribution of 
positive charge and an electron gas of uniform density, the total charge 
being zero. We may assume that the potential is then zero everywhere, 
so that 

~eV 0 = Co, (7.75) 

and the initial electron density, which we will here distinguish by the 
symbol n 0 , is 

n ° = 3h* ( 2m ( 7 * 76 ) 

If the system is a large one it is reasonable to neglect any change in 
-eV 0 when q is introduced. Thus, if the final potential is V(r), taking the 
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origin at the position of q, the final electron density is, from (7.73) and 
(7.75), 


( \ 871 
n(r) = —- 

3/i 3 


2 me I V+ 


e / J 



(7.77) 


Neglecting surface effects, for a sufficiently large system, we may take both 
V(r ) and n[r) to be spherically symmetrical about q. 

Since there is a uniform distribution of positive charge of density en 0 , 
Poisson’s equation now gives 


V 2 V= 4ne(n-n 0 ) 



(7.78) 


The effect of the introduction of q will be to cause a redistribution of the 
electron gas, so as to increase or decrease the electron density around q, 
according to whether q is a positive or a negative charge. The electron gas 
will, in fact, ‘screen’ the charge q so that the field at large distances is zero, 
but this screening will have negligible effect close to q. The boundary 
conditions on V(r ) are, therefore. 


V(r) —► 0 as r —► oo, 
V(r ) -► - as r -* 0. 


(7.79) 


Equation (7.78), as it stands, can only be solved numerically, but if we 
assume that V is so small that the square of eV/C 0 can be neglected in 
comparison with unity then an approximate analytical solution may be 
obtained. This assumption is certainly not valid very close to q, since 
V(r) -*■ oo as r -*■ 0, and, furthermore, equation (7.78) itself does not hold 
when eV/£ 0 < — 1. None the less, provided the assumption fails in only 
a very small region about q, we may hope to obtain at least a rough ap¬ 
proximation to V in this way. We thus expand (1 +eV/(, 0 ) i binomially and 
find, approximately, 
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Since V(r) is spherically symmetrical, equation (7.78) becomes 

d 2 V 2dV 6ne 2 n 0 .. 

dr 2 r dr Co 


(7.80) 


It may be verified by substitution that the solution of this equation which 
satisfies the boundary conditions (7.79) is 


V(r) = - e“ r/A , 
r 

where X is positive and given by 

-JU*. 

6ne 2 n 0 J 

The potential is thus a screened Coulomb potential and X is called the 
screening distance — it is the distance in which the potential is reduced to 
about one third of its unscreened value, and is a rough measure of the 
effective range of the field due to q . 

If we assume that the electron gas is that of the valence electrons in a 
monovalent metal, then, in terms of the atomic radius r s , we have 



(7.81) 

(7.82) 


and 


so that 



where a 0 — h 2 /me 2 is the Bohr radius. If r s is measured in Bohr units, 
this gives 

X = 0.64 rf B.u. (7.83) 

For sodium r s is about 4 B.u., so that 

X = 1.28 B.u. = 0.68 A, 

which is much smaller than the average inter-electronic distance. 

The calculation has application in estimating the potential due to an 
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impurity atom in a metal, which is used in the theory of alloys. If an im¬ 
purity atom of valency z +1 is dissolved in a monovalent metal, for instance, 
it may be regarded roughly as a point positive charge ze in a free-electron 
gas, and the potential is given immediately by (7.81), with q = ze. 

The calculation also provides a rough indication of the screening effect 
of all the other electrons on the field of any one of them — in other words, 
of the way in which Coulomb correlations reduce the effective range of the 
interaction among electrons. This problem will be dealt with more fully 
in Chapter 10, and it will be seen that the Thomas-Fermi result exaggerates 
the screening effect, that is, it gives too small a screening distance. 


CHAPTER 8 


BLOCH FUNCTIONS AND BRILLOUIN ZONES 


8.1. Motion of an Electron in a One-Dimensional Lattice: Bloch’s theorem 


In the free-electron model the periodic electrostatic field present in an 
actual metal is neglected. In order to obtain some idea of the effects of the 
periodic field we will first discuss the idealized problem of the motion of a 
single electron in a one-dimensional crystal lattice; the potential energy 
V(x) being a periodic function of x, the 
distance measured along the lattice, such 176 ' 
as that shown in Figure 8.1. We will take — 
the period to be a and the length of the 
lattice to be L = Ga , where G is an integer. 

Periodic boundary conditions are applied, 
so that, as in § 7.3, we may imagine the 
lattice bent round until the two ends coin¬ 
cide, to form a ring. 

Now, when the potential energy is constant, we know that the wave 
function of an electron is of the form 


I I 

I I 



I 


Fig. 8.1. Potential energy of an 
electron in a one-dimensional 
lattice. 


M*) - e iftJC , (8.1) 

where 


k — 


2ng 
L ’ 


9 = 0 , + 1 , ± 2 , . . .. 


Taking the periodic field into account, we should expect the wave function 
to be of similar type, but modulated by the field. Since the assumption of 
periodic boundary conditions implies that surface effects, or, in this case, 
end effects, are neglected, we should expect the modulating function to 
have the period of the lattice — in other words, it is expected that a wave 
function will have the form 


Ux) = e ifc X«, 

196 


( 8 . 2 ) 
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where the function u k (x) has the period of the lattice, that is, 

«k(x+a) = u k (x), (8.3) 

and in general depends upon k. 

The proof that the wave functions of an electron in a periodic field 
actually are of this type was given by Bloch in 1928, and is known as Bloch’s 
theorem in metal theory, although it is mathematically the same as a much 
earlier theorem due to Floquet. The functions (8.2) are called Bloch functions. 
We will now give an elementary proof of Bloch’s theorem. 

We wish to find the general form of the acceptable solutions of the 
Schrodinger equation 

^ + 0, (8.4) 

dx n 

where 

V(x) = V(x + a), (8.5) 

subject to the periodic boundary condition 

\//(x) = \]/(x+Ga). (8.6) 

If, as described above, we imagine the lattice bent round to form a ring, 
it is clear from the symmetry of the ring that the probability density must 
be a periodic function of period a. In other words, we must have 

l<M*)l 2 = |«K*+a)| 2 , 

which means that 1 


t This result may be obtained without any reference to the probability density in the 
following way: 

Equation (8.4) may be written 

Htp(x) = ey)(x), 


where 


h 2 d 2 

"=- 2 ^ +KW - 


Now, if r is an operator which changes x into x+a , so that 


then 

r h 2 d 2 i 

m *,) =L-^ + ^+«)J v(*+«> 

= Hxp(x+a) = HFip(x), 

owing to (8.5) and the fact that d/d (x+a) = d/dx. F thus commutes with H , so that the 
two operators have simultaneous eigenfunctions (see § 2.6), and we may therefore write 


ryj(x) = l\p{x) = y)(x+a). 
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\}j{x + d) = X\j/{x), (8.7) 

where A is a constant. Also, 

\p{x+2a) = X\p(x + a) = X 2 \p(x), 
and, if g is any integer, 

\J/(x+ga) = X g \j/(x). 

It follows from this and equation (8.6) that 

i^x+Ga) =■■ X°rj/{x) = ij/(x). 

Hence, 

X G = 1, 
or 

X = e 2,ti0/c , where g = 0, ±1, ±2. 

The wave functions must therefore satisfy the so-called Bloch condition 


\f/(x+a) = e 2n,g/G if,(x). 

(8.8) 

If k is defined as in equation (8.1), 


i 2 ?zg 

k = 9 = 0, ±1, ±2,..., 

Ga 

(8.9) 

then the functions 


K(x) = e ikx u k (x) 

(8.10) 

satisfy (8.8), provided that 


u k (x + a) = u k (x). 

(8.U) 


Since we have made no assumption about u k (x), apart from its periodicity, 
this result is quite general, and we may assume that the wave functions 
are of the form (8.10), which is Bloch’s theorem. 


8.2. Brillouin Zones in One Dimension 

It is easily seen that k is not uniquely specified by (8.10) and (8.11), 
for we may write 

e>k * u k( x ) = ^ k+2nn,a)x le- i2nnx/a u k (xJ]} (8.12) 

where n is any integer, and the function in square brackets still has the 
period of the lattice. This means that we may replace k by k+(2nn/a) 
without destroying the form of the Bloch function. It is therefore possible 
to restrict the values of k to an interval of length 2 nja, which for con- 
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venience we take to be symmetrical about k= 0 — that is to say, we choose 
k to he in the interval t 


(8.13) 

a a 

which we will call the fundamental domain of k. 

The wave functions are then written 


<M*) = e ikx u u(x), (8.14) 

where 

u kl (x + a) = u kl (x), (8.15) 

and there is an infinite number of states, labelled by the quantum number 
, corresponding to any value of k in the fundamental domain. Substituting 
(8.14) in (8.4), and dividing by e 1 **, we find that u kl must satisfy the dif¬ 
ferential equation 


d 2 u k , ... d u kl 2m 
—f +2i fc—+ — 
dx 2 dx h 2 


h 2 k 2 

Si(k)-^--V(x) 

2m 


u kl = 0. 


(8.16) 


The complex conjugate of this equation, assuming V(x) to be a real 
function, is 


d_!^_ 2ifc ^ + 

dx 2 dx h 2 


* 2m 


h 2 k 2 

e^k)-^--V(x) 

2m 


u* = 0 , 


(8.17) 


and, changing A: to —A in (8.16), the equation for u_ fcI 
d 2 u_ t 


-kl 


dx 2 


... d u~ kl 2m 

— 2\k -- + — 

dx h 2 


h 2 k 2 

2m 


u _ 


ki 


is found to be 
= 0. (8.18) 


Comparing equations (8.17) and (8.18), we deduce that 

««(*) = w-fcj(x), (8-19) 

and 

e,(/c) = £j(—A). (8.20) 

The functions ij/ kl and t J/_ kl are therefore degenerate. 

A very small change in k in equation (8.16) will result in only a small 
change in u kl and in e^k). Hence, if a given label / is properly ascribed, it 


t Since k = Ing/Ga, where g is an integer, it is evident that the end points of this 
interval, ±n/a, will only be included if G is an even integer. Furthermore, since the end 
points differ by 2 n/a, it is only necessary to include one of them in the domain of k , 
the choice being arbitrary. For a real metal, however, G is a very large number, so that 
k is almost a continuous variable, and the question of the inclusion or not of the end 
points is of no practical importance. 
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will specify a set of states, one for each k value, for which the energy e t (k) 
is a quasi-continuous function of k. This set of states is called a Brillouin 
zone. The nomenclature in this part of the subject is by no means universal, 
however, and it should be noted that what we have called the fundamental 
domain of k is more often referred to as the basic Brillouin zone, or, for 
reasons which will appear, the first Brillouin zone or the reduced zone. 

As a simple example, let us consider the free-electron model, even though 
the introduction of a lattice periodicity is rather artificial in this case. It 
may be verified directly by substitution in the Schrodinger equation (8.4), 
with V(x) = 0, that the wave functions and corresponding energies may 
be written 



( 8 . 21 ) 


<Mx) = e , *V 2 " w “, 


,,, h /, 2 %IY 

e ‘ (k) = £ r T ’ 


n 

- < k < 0. 

a 


( 8 . 22 ) 


These wave functions are of Bloch form, and all possible states are in¬ 
cluded by letting / have the values 0, 1, 2. .., each value specifying a 
Brillouin zone. The curves of sfik) 
against k are shown in Figure 8.2 (a). 

It is seen that, in this case, the energy 
curves of consecutive zones touch either 
at k = 0 or at k = ±n/a. The reason 
for this is quite obvious from the nature 
of the free-electron model — we know 
that the energy distribution is quasi- 
continuous. However, this is not so 
when a non-zero periodic field is present. 

Figure 8.2 (b) shows the general effect 
of a small field with period a (we shall 
consider this in detail in the following 
section). The energy curves for the 
different zones are now smooth as well 
as continuous, and those of adjacent 
zones no longer touch — there are finite 



Fig. 8.2. Energy of an electron in a 
one-dimensional lattice as a function 
of k: (a) zero potential energy (free 
electron), (b) small periodic poten¬ 
tial energy. 


energy gaps, such as AB, CD, EF, between the zones, and no state has 
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energy lying within these gaps. The energy distribution is thus no longer 
quasi-continuous, but is broken up into quasi-continuous bands of energy 
levels separated by forbidden energy regions. Although Figure 8.2 (b) 
shows only small energy gaps, and energy curves which do not differ much 
from the free-electron curves, this is by no means necessary, but only applies 
when the periodic field is small. If the periodic field is of large amplitude, 
the energy curves may deviate considerably from the free-electron curves, 
but, none the less, their general characteristics will be as shown. 

We have mentioned that the introduction of a lattice periodicity is rather 
artificial in the case of free electrons, and the use of the Bloch functions 
in equations (8.21) and (8.22), although correct, is unnecessarily tedious. 
Since the potential energy is constant there is no reason for choosing one 
period rather than another — in fact, here it is more natural simply to use 
the wave functions (8.1) without restricting the range of k. The one-electron 
energies are then given by 

h 2 k 2 

e(k) = —~, (8.23) 

2m 

and the energy curve is the simple parabola shown in Figure 8.3 (a). Even 
when a periodic field is present it is possible to use a similar representation, 
that is, one in which the range of k is not restricted. Since we know that the 
form of a Bloch function is unaltered when k is replaced by k + (2nn/a), 
where n is any integer, positive or 
negative, it follows that any energy 
curve in Figure 8.2 (b), or segment 
of such a curve, may be translated 
through a distance Inn/a parallel to 
the k axis and the result will remain 
correct. In particular, the right and 
left halves of the energy curves may 
be translated separately, so as to give, 
by analogy with the free-electron 
parabola, the energy curve shown in 
Figure 8.3 (b). For example, the 
segments BC and B'C are translated 
a distance In/a to the left and to the 
right respectively. This is called the 
extended zone scheme, the first Bril- 
louin zone now corresponding to values of k between —nja and nja, 


£(k) 


£(k) 



(a) (b) 

Fig. 8.3. Energy of an electron in a one¬ 
dimensional lattice, according to the ex¬ 
tended zone scheme: (a) zero potential 
energy (free electron), (b) small periodic 
potential energy. 
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the second Brillouin zone to values of k lying between —2n/a and 
— nla as well as between 7t/a and 2n/a, and so on. On the other hand, 
the situation represented in Figure 8.2 is often called the reduced zone 
scheme — clearly, it is possible to start with the extended zone picture 
and reduce it to that shown in Figure 8.2 by suitable translations through 
2nnla, which is the approach often adopted in elementary treatments. 
This explains the alternative names given to the fundamental domain 
of k. 

The reduced and extended zone schemes are equally valid, and which 
one is used in a particular problem is largely a matter of convenience. 
Although the extended zone scheme appears to be the more natural when 
one considers free, or nearly free, electrons, we wish to emphasize that 
this is by no means so in general. In solving the Schrodinger equation 
for an electron in a periodic field one assumes solutions of the form (8.10), 
and this leads naturally to e{k) being found as a many-valued function 
of k, the different values of which ve have distinguished by means of the 
subscript /. If no restriction is put upon k, therefore, the energy curves 
which one obtains by straightforward calculation are not immediately 
those of the extended zone scheme, Figure 8.3 (b), but those shown in 
Figure 8.4. The essential thing to notice is that the energy is a periodic 


m 



Fig. 8.4. Energy of an electron in a one-dimensional lattice as a many-valued 
function of k, when k is unrestricted. 

function of k, with period 2n/a. All possible states are included in a single 
period, which is the justification of the reduced zone scheme. The choice 
of the fundamental domain of k to be from — nja to n/a is purely one of 
convenience, however, and is due to the fact that the energy curves are 
symmetrical about the origin — it is important to realize that any interval 
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of length 2n/a would suffice. The extended zone scheme is obtained by 
choosing one particular energy curve in each half period — in the case 
of nearly free electrons, we choose the one which lies closest to the free- 
electron parabola, and, in general, we choose the one which makes e a 
monotonic increasing function of |k|. 

The energy curves are continuous and smooth (the free-electron case 
excepted), and it follows from their periodicity and symmetry about k = 0 
that they must all have zero slope at k = 0 and k = ±nja. Thus, owing 
to the periodicity, we have 



and, in particular, 

/deA = /dsA 

\dk). n/a \dkj n/a ’ 

while, owing to the symmetry about k = 0, we have 


and 


e,(/c) = £;( — k), 


7 e i ( k ) = “ J7 e i(~ k y 
d k d k 


From (8.25) and (8.26), we have, finally, 



It also follows from (8.26) that 



(8.24) 


(8.25) 


(8.26) 


(8.27) 


According to equation (8.9) there is an interval InjGa between one k 
value and the next. In the fundamental domain of k, therefore, there are 


2n Ga 
a 2n 


k values, and this is the number of orbital states in each Brillouin zone. 
Before going on to a more quantitative discussion of energy gaps it will 
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be useful to prove the following theorem: 

if F(x) has period a, then 


f e ifc *F(x)dx = 0, 
Jo 


(8.28) 


unless k = 2nn/a, where n is a positive or negative integer or zero. 

Here, as before, L = Ga, so that, if q denotes an integer, we may write 

rL G- 1 r(q+l)a 

e ifcx F(x)dx = e ikx V(x)dx, (8.29) 

Jo q- 0 J qa 

which just breaks the range of integration up into G intervals of length a. 
Also, writing x = X+qa, we have 

/*0+l)a ra 

e ikx V(x)dx = e ik(X+qa) V(X + qa)dX 

J qa Jo 

= e ik - f t ikX V(X)dX, (8.30) 

J 0 

owing to the periodicity of F(x). It is immaterial whether we use X or x 
as the variable of integration in (8.30), since this is a definite integral. 
Hence, the same integral is a factor of every term in (8.29), which becomes 

Ve^fe^F^dx. (8.31) 

9 = 0 J 0 


Now, 


G — 1 


£ e ikqa = l+e ika + e i2ka + .. . +e i ( G - 1 ^ 

9 = 0 


1— e iGfca i _ e' 2 * 9 


= 0, 


(8.32) 


1 — e ifca i _ e i2 ’ l9/G 

unless Q ,ka = 1, that is, unless k = 2nn/a, where n is a positive or negative 
integer or zero. In the latter case, each term of the series is unity, and the 
sum is G. This completes the required proof and shows, in addition, that 

e i2nnx/a F(x)dx = G e i2nnx /° K(x)dx, (8.33) 

whjch is otherwise obvious, since the integrand has period a. 

It follows immediately, if </^ fc (x) and i/v(x) are Bloch functions defined 
by (8.10), that 


f t A*0 c ) V(x)\j/ k ,(x)dx = 0, 
J o 


(8.34) 
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unless k' — k = Inn!a, for this integral is 



and is of the same form as that in equation (8.28),' since V(x)u*(x)u k ,(x ) 
has period a. 

Also, if iA fci (x) and t/vr(x) are Bloch functions defined by (8.14) in the 
reduced zone scheme, then 

f ^uix)V(x)\j/ kr (x)dx = 0, (8.35) 

J o 

unless k = k ', since, when k and k' are restricted to the fundamental 
domain, the only permissible value of n is zero. 

8.3. Energy Gaps in One Dimension by Perturbation Theory: Nearly Free 


Electrons 


We will now calculate the energy gaps for a one-dimensional lattice 
on the assumption that the periodic potential energy V(x ) is of very small 
amplitude. We may then treat V(x) as a small perturbation applied to a 
free-electron system. 

Since we are starting with free-electron functions it is convenient in this 
case to use the extended zone scheme — in other words, we take the un¬ 
perturbed wave functions, normalized in the length L of the lattice, to be 
given by 



(8.36) 


where k is not restricted to the fundamental domain. 

Now, the unperturbed functions ip k (x) and ij/- k (x) are degenerate, both 
corresponding to the unperturbed energy h 2 k 2 /2m, so let us apply the first- 
order perturbation theory for a doubly degenerate state described in 
§ 3.7. The functions are orthonormal in the length L, so that, if s' is the 
perturbation energy, the secular equation, from (3.85), is 



(8.37) 


Heret 


t The quantities V kk , V k _ k , etc., are the matrix elements of the potential energy with 
respect to the unperturbed wave functions. 
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and 



(8.38) 


(8.39) 


(8.40) 


Equation (8.38) shows that V k k , which is the same as V_ k _ k , is just the 
average of V(pc) over the lattice, and we may choose our zero of energy 
so that this average vanishes. The secular equation then becomes 

s' 2 = = (8.41) 

From (8.28) we see that V ki is, in fact, zero unless k = nn/a, where n 
is a positive or negative integer or zero, and in this case V k> . k becomes 

K = 7 f e~ i2nnx/a V(x)dx = - f e~ i2nnx/a V(x)dx, (8.42) 

LJ o aJ 0 

by (8.33). First-order perturbation theory thus predicts no splitting of the 
degenerate energy levels, except when k = nuja, and here the unperturbed 
level splits into the two perturbed levels given by 



The energy gap at k — un/a, that is, between zone |«| and zone |/z| +1 
in the reduced zone scheme, is therefore 


Ae(nn/a) = 2\V n \. (8.43) 

Of course, although energy gaps only occur at k = nn/a, the energy levels 
for other values of k must be displaced, so that the final energy curve is 
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like that in Figure 8.3 (b), but this displacement is not given by first-order 
perturbation theory. 

The quantities V n are just the Fourier coefficients of the potential energy, 
that is, the coefficients appearing in the complex Fourier series 

V(x) = f Vn e i2 " nx,a . (8.44) 

n= — oo 

It may be verified by the method described in Appendix 2 that, if a function 
F(x) with period a is expanded in such a series, the coefficients are given by 
(8.42). 

The perturbed wave functions .at k = nnja are linear combinations of 
the unperturbed functions, and so have the form 

A 1 e nKXla +A 2 z- innxla . 

Considerations of symmetry, or direct calculation by perturbation theory, 
show that A x = +A 2 , and the perturbed functions are proportional to 

. nnx . nnx 

sin- and cos-. 

a a 


Thus, the perturbed functions at the zone boundaries are no longer running 
waves, but are standing waves. It is clear why such standing waves should 
in general have different energies, for the maxima of their squared moduli 
do not coincide. If V(x) has troughs at x = 0, a, 2a, ..., as shown in 
Figure 8.5, then the cosine function will always have the lower energy, for 



Fig. 8.5. Potential energy function 
for an electron in a one-dimensional 
lattice and probability densities for 
states at the boundaries of the first 
Brillouin zone. 



Fig. 8.6. Reflection of a 
beam of electrons from 
a crystal. 


cos 2 {nnx I a) has maxima at the points of lowest potential energy, whereas 
sin 2 (nnx/a) has minima. The squares of the two functions with n = 1 are 
also shown in Figure 8.5 — the cosine function corresponds to the point 
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A (or A') in Figure 8.2 (b), and the sine function to the point B (or B'). 

There is a correspondence between the occurrence of energy gaps and 
the condition for the Bragg reflection of electrons (or X-rays) from a 
crystal, which is worth noting. If a beam of electrons (or X-rays) with 
wavelength X is incident at an angle 0 to a face of a crystal, and d is the 
distance between planes of atoms parallel to that face, as shown in Figure 
8.6 the beam will be reflected provided the Bragg condition 

nX = 2d sin 9 (8.45) 

is satisfied. This is because electron waves reflected from successive planes 
of atoms will be in phase, and hence reinforce one another, when this 
condition is satisfied. If we have only a one-dimensional lattice, or, what 
amounts to the same thing, if a beam of electrons is incident normally 
on a face of a crystal, and the lattice spacing is a, then the condition for 
reflection is 

nX = 2a, (8.46) 

from (8.45), 9 being n/2 in this case. Since X = 2n/k, this condition becomes 

k = nn/a. 

This is in agreement with the theory of energy gaps developed above, 
for no electron can move through the lattice with energy lying in an energy 
gap, and these occur at k = nn/a. Hence, if a beam of electrons with such 
energy is incident on a lattice it must be totally reflected. 

8.4. The Tight Binding Approximation in One Dimension 

Let us first consider an electron in an isolated potential well U(x), such 
as that shown in Figure 8.7 (a), which we may regard as the potential 
energy of the electron due to a one-dimensional ‘atom’. We will fix our 
attention upon a particular bound state of this electron, whose wave 
function y(x) satisfies the Schrodinger equation 

d 2 y 2m r , T . ... 

^2 + i>0- = 0, (8.47) 

and falls exponentially to zero as x tends to + oo. If it is the ground state 
wave function, x(x) will have a form similar to that shown in Figure 8.7 (a), 
but there is no reason why x(x) should not be the wave function of an 
excited state, and hence have nodes. We will assume, however, that x(x) 
is non-degenerate, which is generally the case in one dimension, and also 
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that U(x) is an even function, that is, 

U(x) = U(-x). (8.48) 

Now, if a number G of such one-dimensional atoms is arranged on a line, 
with a distance a between one atom and the next, the result is a lattice of 
the type discussed above, and the potential energy V(x ) of an electron in 
the lattice is just the sum of the potential functions of all the atoms, namely, 

= I U(x-la), (8.49) 

i 

where / ranges over G successive integers t. 



Fig. 8.7. (a) Potential energy U(x) and wave function 
%(x) of an electron in an isolated one-dimensional atom. 

(b) Potential energy V(x) of an electron in a lattice of 
such atoms, and the overlapping atomic functions. 

The Schrodinger equation for an electron in the lattice is thus 

+ Ti [ fi ~ £ U(x-lay]\J/ = 0. (8.50) 

dx ft i 

If the lattice spacing a is so large that there is little overlap between 
the atomic wave functions of different atoms — more specifically, if the 
product U(x—la ) y(x—na) is small except when / = n — then it is easily 
seen that any of the functions x(x —/a) is approximately a solution of equa¬ 
tion (8.50). The equation satisfied by x(x— la) is 

d ,^-U) v( u) = 0j (g. 51) 

dx z ft 


f In fact, if G is a finite number, V(x) as given by (8.49) is not strictly periodic. However, 
we assume that G is a very large number, so that the departure from periodicity is only 
noticeable at the extreme ends of the lattice, and its effect is completely negligible — 
this is consistent with the discussion of the field in a real metal given at the beginning 
of Chapter 7. 
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which is what equation (8.50) becomes, with s = e 0 , when we drop all 
terms in the potential energy except U(x — la). Since all the functions 
X(x — la ) are equally good approximate solutions when overlap is neglected, 
we expect the proper solution of (8.50), in the case of small but finite over¬ 
lap, to be very nearly a linear combination of these functions, that is, 

<K X ) = Z CiX(x~la), (8.52) 

i 

approximately, where the constant coefficients c ( are to be determined. 
This is called the tight binding approximation in metal theory, and is just 
the LCAO (Linear Combinations of Atomic Orbitals) approximation 
described in connection with the hydrogen molecule in § 5.8. 

The coefficients c t may be obtained by the variational method, but this is 
not necessary, for they are given immediately by the requirement that the 
wave function satisfy the Bloch condition. Thus, if we introduce the wave 
number k , defined as before, and write 

<M*) = Z e ikla x(x-la), (8.53) 

i 

we see that 

< h{x + a) = Ze ift,a x[x — (/ — l)a] 

l 

= e ika M*)l (8.54) 

for a sufficiently long lattice, as required. The function (8.53) is thus a 
Bloch function and is of the general form (8.52). 

If tve assume that the atomic orbitals are normalized in the length of 
the lattice (or, nearly enough, in infinite length), and ignore overlap between 
those on different atoms, then 

j X*(x-na)x(x-la)dx = 3 nl , (8.55) 

and 

J 'I'k'l'kd* = Z Z e'* (i-n)<J J x*(x—na)x(x—la)dx 

= ZZ e iHl ~ n)a S nl = G, 

n l 

all integrals being over the length of the lattice. The normalized approximate 
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wave function of the state k is, therefore, 

Hx) = G-±'£t mo X {x-la), (8.56) 

and the corresponding approximate energy is 

t » 

e(k) = j ij/*Hif/ k dx, (8.57) 

where 




(8.58) 


Now, 

Hijs k = G-±Ze ikla Hx(x-la) 


= G~* £e 


i kla 


h 2 d 2 
-- + U(x — la) 


x(x — la) 


2m dx 2 

+ G ~i £ e ikla [V(x)~ U(x-la)] X (x-la) 

l 

= e o'/'k + G~ i £e' kla [V(x)~ U(x-la)]x(x-la), (8.59) 

using (8.51), so that 

e(k) = JV*e 0 ij/ k dx 

+ G ~ 1 £ X e ,,t(i_n)0 U(x — lay]x(x— la)dx. (8.60) 

n l J 

Writing X = x — la, we have 


J X*(x—na) [V(x)-U(x-la)] x(x-la ) dx 

= j xW+ma) (K(J+/<!)- 6'U'j] /(A") dX 

=.J X^X+ma) [V(X)~ l/(*)] *(*) dX, 

where m = l—n. Strictly speaking, the limits of integration with respect 
to X should be different for each value of /, and hence vary with m and n, 
but for a very long lattice, and consistent with the use of periodic boundary 
conditions, this fact may be neglected and all the integrals taken over the 
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same range. We then have 

e(k) = e 0 + £ e ' kma f X*(x+ma)[V(x)- F(x)]x(x)dx. (8.61) 

m J 

Our initial assumption was that overlap between different atomic orbitals 
is small, so we will neglect all the integrals in (8.61) except those for which 
m = 0 or ±1 — that is to say, we will only take into account overlap 
between nearest neighbouring atoms. We will also write 

« = - [ X*( x ) [V(x)- U(x)] x(x) dx, 

7 = ~ j X*(x+a) [V(x)~ £/(x)] x(x) dx, 

and it is easily verified that, as U(x) is an even function, and y(x) is either 
even or odd, 

7 = ~ J X*(x-a) [V(x)~ £/(x)] x(x) dx 
also. This gives, finally, 

s(k) = £ 0 — a — 2y cos ka. (8.62) 

We thus see that the atomic energy level e 0 splits up into a band of energy 
levels in the lattice, and, as cos ka lies between —1 and 1, the width of 
the band is Ay. It will be noticed that all possible states in this band, or 
Brillouin zone, are represented by values of k lying between —iija and 
n/a, s being a periodic function with period Infa (compare Figure 8.4). 

Now, if U(x) is negative, F(x) — U(x) is 
also negative, so that a is positive. Further¬ 
more, if y(x) is an even function of the 
type shown in Figure 8.7 (a), y(x) and x( x +a) 
have the same sign, so that y is positive. 

In this case, the energy curve is that shown 
in Figure 8.8, which resembles the curves 
for l = 0 or / = 2 in Figure 8.2 (b). On the 
other hand, if y(x) is an odd function with 
one node at the origin, as it might be for Fi §- 8 - 8 - Energy as a function of k 

the first excited state, a is still positive, but for one - dimensional binding 

, . example. 

y is negative, so that the energy curve now 

has a maximum at k = 0 and resembles the curve for / = 1 in Figure 
8.2 (b). 
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There is an important difference between the curve of Figure 8.8 and 
the lowest curve, say, of Figure 8.2 (b), which represents the nearly 
free electron approximation. In the former, the segment between k = 0 
and k = n/a, for example, has a symmetry about k = n/2a which is not 
present in the latter curve, this being almost a parabola except near 
the zone boundaries. 

In principle, every atomic level will broaden into such a band of levels, 
so that by taking all the atomic levels in turn we would get an infinite set 
of energy curves, as in Figure 8.2 (b), the equation of each being (8.62), with 
appropriate values of e 0 , a, and y. The approximations we have employed 
must be borne in mind, however. In particular, we have assumed that the 
atomic wave functions overlap very little, which implies that y, and hence the 
energy band width 4y, must be small. On the other hand, the construction 
of the Bloch functions from the single atomic function %(x) associated 
with the energy level e 0 implies, not only that this level is non-degenerate, 
but that it is isolated from any other atomic energy levels. The simple 
method we have described is thus appropriate only when it leads to 
narrow energy bands with wide energy gaps between them. The perturbation 
method described in the foregoing section, however, is appropriate only 
when the energy bands are wide and the energy gaps narrow. 

The method can easily be extended to degenerate levels. Instead of the 
single function x, a linear combination of the degenerate functions associated 
with a given level is used to construct Bloch functions similar to (8.53), 
the coefficients being determined by the variational method. Mathematical 
techniques have also been developed to facilitate the extension of the method 
to cases where overlap is considerable and involves' more than nearest 
neighbours, but the computational labour involved in obtaining accurate 
results in such cases generally makes other methods preferable. 

We may expand cos ka in equation (8.26) and, when k is very small, 
retain only the first two terms in the expansion. This gives 

e(k ) = e 0 — a— 2y + ya 2 k 2 , (8.63) 

which, apart from the position of the zero, is of form similar to that of the 
free-electron energy (8.23). To point this similarity we sometimes write 

ya 2 — h z l2m*, (8.64) 

where m* is known as the effective mass of an electron near k = 0. 

So far, in this chapter, we have been concerned essentially with the pure 
mathematical problem of the solution of the one-dimensional Schrodinger 
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equation (8.4) for a single electron in a periodic field. This solution can be 
obtained to any degree of accuracy using numerical methods, but the labour 
involved is generally considerable, which is why we have confined ourselves 
to the two important approximations discussed above. Exact solutions 
have been obtained by simple analysis only when the potential function is 
either constant (free electrons) or consists of equally spaced delta func¬ 
tions t (Kronig and Penney model). The latter case is rather artificial and 
tells us little that we have not already established, so we will omit it — 
accounts of it may be found in many of the works listed in the Bibliography. 

In extending these ideas to real metals it must be remembered that many 
electrons are present, and these contribute substantially to the field acting 
upon any one of them. It is clear that here again Coulomb correlations 
are going to play an important part. The effect of these correlations, in fact, 
is such that Bloch functions do not always provide the best description of 
the electronic motion. For example, we expect Bloch functions to be 
appropriate for the valence electrons in a metal, but not for the electrons 
in the very tightly bound inner shells. The reason for this should be clear 
from our discussion of the relative merits of the Heitler-London and 
molecular orbital methods in § 5.8. For the closed inner shells the distinc¬ 
tion, in fact, is not important, for it can be shown that in this case the 
Heitler-London and Bloch methods lead to the same results. It is less easy 
to decide whether atomic or molecular orbitals are more appropriate for 
the electrons in, say, the 3d sub-shells of the transition metals. 

In discussing the energy bands in metals we shall generally have in mind 
the electrons which He outside the closed shells of a free atom, and shall 
assume that the Bloch method is appropriate. 

8.5. The Crystal Structures of Metals 

In a metal, as in other soUds, the nuclei of the atoms are arranged in a 
periodic spatial array, or crystal lattice, which may be thought of as con¬ 
structed from identical unit cells , each containing one or more atoms. 
For the simplest kind of crystal lattice, called a Bravais lattice, the unit 

t The delta function 8{x) may be defined as follows: 

<5(x)'= 0 for x and f* (50) dx = 1, 

J —a 

where a and b are any positive numbers. One may usefully think of d(x) as the limit, 

as c 0, of a function which has value zero outside the interval (—c, c) and value 1 /2c 
within this interval. 
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cell may be taken to be a parallelepiped containing one atom, as shown in 
Figure 8.9. It is convenient, although not necessary, to take the position 
of the atom (or more precisely, the nucleus of the atom) to be at a corner 
of the cell; then, if the edges of the cell are specified by the vectors a lt 
a 2 , a 3 , any atomic position, or lattice point, may be obtained from any 
other by a translation through a lattice vector R n , given by 

K = n 1 a l +n 2 a 2 +n 3 a 3 , (8.65) 

where n x , n 2 , and n 3 are integers. A more complicated lattice may be 
described as a Bravais lattice with a basis; that is to say, the unit cell contains 
more than one atom and the basis gives the positions of the atoms within 
the unit cell. In this case only the positions of similarly situated atoms 
within two unit cells differ by a translational vector of the form (8.65). 



Fig. 8.9. A Bravais Lattice. The thicker lines indicate a unit cell. 

The most elementary crystal structure is the simple cubic lattice, which 
is a Bravais lattice whose unit cell is a cube. If e x , e 2 , e 3 are, as usual, unit 
vectors defining rectangular Cartesian coordinates, and we take these to 
be along the cube edges, then 

a = ae x , a 2 = ae 2 , a 3 = ae 3 , (8.66) 

where a is the distance between nearest neighbouring atoms. Although no 
metal crystallizes in this structure, we shall sometimes use it as a simple 
model of a three-dimensional lattice. 

Most metals have crystal structures of one of the following types: (i) body- 
centred cubic, (ii) face-centred cubic, (iii) close-packed hexagonal. The 
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first two are simple Bravais lattices, having rhombohedral unit cells, but 
the last is not — it may be described as a Bravais lattice with a basis, its 
unit cell containing two atoms. Even in the case of the cubic structures, 
as the name implies, it is generally more convenient to take the unit cell 
to be a cube, containing several atoms — two for the body-centred cubic 
lattice, and four for the face-centred cubic lattice. In other words, we take 
each of them to be a simple cubic lattice with a basis. Let us consider 
these important structures in more detail. 

(i) The body-centred cubic lattice. For this 
structure the unit cubic cell has atoms at 
the corners and one at the centre, as shown 
in Figure 8.10. Seven of the comer atoms 
are associated with neighbouring cells, 
however, so that there are only two atoms 
per unit cell. The lattice may be thought 
of as two interpenetrating simple cubic 
lattices, the lattice points of one being at 
the centres of the unit cells of the other. 

On the other hand, if e t , e 2 , e z again 
denote unit vectors along the cube edges, 
and a is the length of an edge, the lattice 
may be described as a Bravais lattice, with rhombohedral unit cell, having 
edges 

a i = $a(-e t +e 2 + e 3 ), 

a 2 = \a{e i e 2 -t- £ 3 ), (8.67) 

« 3 = $a(e t+e 2 -e 3 ). 

It may be verified that all the lattice points are then given by equation 
(8.65). The volume of this cell is 

«i • « 2 a a 3 = ^a 3 , 

which is one half the volume of the cubic cell — this is expected, since 
there are two atoms to each cubic cell and only one to each rhombohedral 
cell. 

Each atom has eight nearest neighbours at distance \/3a/2, and six 
second nearest neighbours at distance a. 



Fig. 8.10. The body-centred cubic 
structure. The vectors a v a 2 , a 3 
are the edges of the rhombohedral 
cell of the Bravais lattice. 
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(ii) The face-centred cubic lattice. 


For this structure the unit cubic cell has 
atoms at the corners and at the centre of each 
face, as shown in Figure 8.11. Only four of 
these atoms belong to the unit cell, the others 
belonging to neighbouring cells, and the lat¬ 
tice may thus be considered as four inter¬ 
penetrating simple cubic lattices. Alternatively, 
it may be described as a Bravais lattice, with 
rhombohedral unit cell, having edges 

a i = i a ( e i + e 2)> 

a 2 = 2 a ( e 2 Jre 3)> ( 8 . 68 ) 



Fig. 8.11. The face-centred 
cubic structure. The thicker 
lines are the edges of the 
rhombohedral cell of the 
Bravais lattice. 


a 3 ~ i a ( e 3 + g l)> 


which is also shown in Figure 8.11. The volume of this cell is 

«l'« 2 A«3 = W, 

which is one quarter of the volume of the cube, as expected. 
Each atom has twelve nearest neighbours at distance a/,/2. 


(iii) The close-packed hexagonal lattice 

In this structure the atoms are arranged in equidistant parallel planes, 
in each of which every atom has six equidistant neighbours. The atoms of 
alternate planes are perpendicularly above each other, but the- atoms 
of any one plane are perpendicularly above the centres of the triangles 
formed by groups of three atoms in the next plane, as shown in Figure 
8.12. The lattice obtained by taking alternate planes only is a simple hexagonal 
lattice, so that the close-packed hexagonal structure consists of two inter¬ 
penetrating simple hexagonal lattices. If c is the distance between alternate 
planes and a is the distance between nearest neighbouring atoms in any 
one plane, the lattice may be described as a Bravais lattice (the simple 
hexagonal lattice) with a basis, the unit cell having edges 

a x = ae u a 2 = %ae l +\ y ]3>ae 2 , a 3 = ce 3 , (8.69) 

where the axes e lf e 2 , e 3 are as shown in the figure. The cell contains two 
atoms, one at the origin, and one at 

\ae x +\yjlae 2 +\ce 3 , 
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or, in terms of the basic lattice vectors, at O and i«i+ia 2 +i«3 
basis is generally written (0, 0, 0), (i, i, i), giving 
the oblique coordinates of the atoms in terms of 
the edges of the cell. The volume of the cell is 
a 2 c. 

Now, the close-packed hexagonal structure and 
tne face-centred cubic structure (sometimes called 
close-packed cubic ) represent the two ways in 
which hard spheres may be most densely packed. 

In the close-packed hexagonal lattice each atom 
has six neighbours at a distance a in the basal 
plane through the atom. There are three others 
at distance ^{^a 2 + £c 2 ) in each of the planes on 
either side of the basal plane. In the case of the 
ideal close packing of hard spheres the axial ratio 
cja has the value (8/3)* = 1.63, when all twelve 


The 



Fig. 8.12. The close- 
packed hexagonal struc¬ 
ture. The thicker lines 
are the edges of the unit 
cell. 


neighbours are at the same distance a. We have remarked that in the face- 
centred cubic structure also each atom has twelve equidistant neighbours. 
However, in metals having the close-packed hexagonal structure the axial 
ratio generally differs slightly from the ideal value, the greatest difference 
being in zinc, for which cja = 1.86. Strictly speaking, therefore, in these 


metals each atom has only six nearest neighbours. 

A few metals have other kinds of lattice structures, but we will not 
consider these — the vast majority have structures of the foregoing types. 
Many metals have different structures in different ranges of temperature 
and pressure, a phenomenon which is known as polymorphism. 


8.6. Bloch Functions in a Three-Dimensional Lattice 

Let us consider a crystal lattice whose unit cell has edges a t , a 2 , a 3 , 
and let the dimensions of the lattice be Gqtfi, G 2 a 2 , G 3 a 3 along a x , a 2 , and 
a 3 respectively, G lf G 2 , and G 3 being integers. If it is a simple Bravais 
lattice, then, the total number of atoms is G l G 2 G 3 . The potential energy 
V(r) of an electron moving in this lattice will have the same periodicity 
as the lattice (neglecting surface effects), so that we may write 

V(r+R n ) = V(r), (8.70) 

where R n is defined by (8.65). If there are many electrons present, as in an 
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actual metal, we assume that the potential energy is due to the self-consistent 
field, and satisfies the same condition. 

The basic problem is to solve the Schrodinger equation 

VV+^[£-f(# = 0. (8.71) 

h 2 


As in the one-dimensional case, we assume a cyclic system, and apply 
periodic boundary conditions, which now take the form 

>A0) = ^ + Gi«i) = Hr + G 2 a 2 ) = \j/(r + G 3 a 3 ). (8.72) 

Bloch’s theorem in three dimensions, which may be proved by an extension 
of the method used for a one-dimensional lattice, then shows that the wave 
functions must satisfy the Bloch condition 

'l'k(r + R n ) = e ik ' Rn 'l' k (r), (8.73) 

the wave vector k being defined in such a way that the boundary conditions 
are satisfied. We deduce from this that the wave functions have the form 

Mr) = ^'u k (r), (8.74) 

where u k (r ) has the periodicity of the lattice, that is to say, 

u k (r + R n ) = WfcO). (8.75) 

From the periodic boundary conditions (8.72) and equation (8.75) we 
have 

•AkO) = M r + G i a i) = e iGlk ' fll >Afc(r), 


and similarly for the other two conditions. Hence, 

^Gik-ai _ G 2 k'a 2 _ Q ^G 3 k • a 3 _ j 


so that 


k • a 1 = 






(8,76) 


where g 1} g 2 , and g 3 are integers. 

It follows immediately for a simple cubic lattice, where a x , a 2 , and a 3 
are defined by equations (8.66), that the wave vector k is given by 


k 


2k lg x 

(r- 6l 

a \G X 


02 <02 
H- 02 ^ - e 3 

G 2 g 3 


(8.77) 
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This is not true in general, however. In order to define k in general we will 
introduce what is called the reciprocal lattice. 


8.7. The Reciprocal Lattice: Brillouin Zones 

We define a set of vectors b y , b 2 , b 3 , perpendicular to the coordinate 
planes of the oblique axes a y , a 2 , a 3 , by means of the relations 

a t • bj = 2nd ij (i,j = 1, 2, 3), (8.78) 

where <5 u is the Kronecker delta. We will call the vectors bj the reciprocal 
vectors, and the set of points whose position vectors are given by 

K m = m l b l + m 2 b 2 + m 3 b 3 , (8.79) 

where m 1 , m 2 , m 3 take on all integral values including zero, we will call 
the reciprocal lattice. The reciprocal vectors are, in fact, generally defined 
by equation (8.78) without the factor In on the right hand side, that is, 
as our vectors bj divided by 2n, the reciprocal lattice being defined ac¬ 
cordingly. Here, however, it is convenient to include the factor 2n, and there 
is little point in introducing new names for so trivial a difference. It is 
easily verified that the explicit expressions for the reciprocal vectors are 

b y = — a 2 a a 3 , b 2 = — a 3 a a u b 3 = — a! a a 2 , (8.80) 

1 Q 2 Q Q 

where Q = a y ■ a 2 a a 3 , the volume of a unit cell of the crystal lattice. 

It now follows from equations 18.76) that, in general, the wave vector k 
is given by 

k = ii- b y + — b 2 + — b 3 , (8.81) 

G i G 2 G 3 

so that what we have called ‘k-space’ is the space of the reciprocal lattice, 
and it is consequently referred to as the reciprocal space of the crystal. 
From equations (8.65) and (8.79) we have 

K m • R n = 2n(m 1 n l + m 2 n 2 + m 3 n 3 ), 

and the number in brackets is an integer, so that 

^iK m ' Rn = J 
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The function exp (i K m • r ) thus has the periodicity of the lattice, since 

e iK m • (r + R„) = giKm -r (8.82) 

It may now be seen immediately that the wave vector, k is undetermined 
to the extent of an added reciprocal lattice vector K m , for we may write 
equation (8.74) as 

^ fc (r) = e '(k+K m ) r [ e -iK« *- Ufc ( r )] 5 (8.83) 

and the function in square brackets still has the periodicity of the lattice, 
so that the form of the Bloch function is unchanged. 

Just as in the one-dimensional, case, therefore, we may restrict the k 
vectors to a single cell of the reciprocal lattice, which we will call the fun¬ 
damental domain of k. Solution of the Schrodinger equation will give an 
infinite number of states for each k vector in the fundamental domain, and 
we may label these by a quantum number /; thus, the wave function 

*wM = e“-« H (iO ( 8 - 84 ) 

corresponds to the energy level £j(k). Again, if a given label / is properly 
ascribed, it will specify a set. of states, one for each k vector, for which the 
energy e ; (k) is a quasi-continuous function of k, and this set of states is 
called a Brillouin zone. 

The fundamental domain of k, or basic cell of the reciprocal lattice, 
may be chosen in any number of ways, but it is most convenient to choose 
it to be a polyhedron with as much symmetry as possible about the origin. 
We therefore draw the reciprocal lattice vectors joining the origin, k = 0, 
to the other lattice points, and construct the planes which perpendicularly 
bisect these vectors. We then take the fundamental domain of k to be the 
smallest region t surrounding the origin and bounded by these planes. As 
a rule, only the planes bisecting the position vectors of lattice points which 
are nearest and, perhaps, next nearest neighbours of the origin form the 
surface of the cell. 

The equation of any face of the fundamental domain is easily seen to be 

2 k-K m = |K„| 2 , (8.85) 

where K m is the vector bisected by that face. Thus, from Figure 8.13, if 


t It will be seen in the following chapter that the region defined in this way is just 
the Wigner-Seitz cell of the reciprocal lattice. 
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P is the mid-point of K m , and k is the position vector of any point in the 
plane through P, we have 

OP = ilKJ = k cos e = • 

l & ml 

and (8.85) follows. To each face there is a parallel face, whose equation is 

2k-K m + \K m \ 2 = 0, (8.86) 


the perpendicular distance between the two planes 
being K m . 

As an example, let us consider the simple cubic 
lattice. We obtain, directly from (8.80), or from 
(8.77) and (8.81), 

* 27t . 2 7t , 271 /q o"7\ 

b 1 =—e i , b 2 = —e 2 , b 3 = —e 3 , (8.87) 

a a a 



the reciprocal vectors being parallel to the lattice 
vectors. The fundamental domain of k is thus a 
cube of side 2n/a. The vectors K m bisected by 
the faces are ±b lf ±b 2 , ±b 3 . If k x , k 2 , k 3 are 
the components of k along e x , e 2 , e 3 , respectively, 
the equations of the six faces are 


Fig. 8.13. k is the po¬ 
sition vector of a point 
on the plane which per¬ 
pendicularly bisects K m , 
P being the mid-point 
of K m . 


/Ci = 





( 8 . 88 ) 


which also follow from equation (8.85). 

Whatever the shape chosen for the basic cell of the reciprocal lattice, 
its volume is always given by 

8^.3 

b i ■ b 2 a b 3 = — (a 2 a a 3 ) ■ (a 3 a a x ) a (a t a a 2 ). 

Q 


It may be verified that 

(a 3 A a t ) a (a t a a 2 ) = [«! • (a 2 a a 3 )']a x = Qa x . 


Hence 


b\ ‘ b 2 a b 3 


8 jt 3 

- — a x • a 2 a a 3 


87^ 

Q 


(8.89) 


The volume of the fundamental domain of k is thus 87t 3 times the reciprocal 
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of the volume of a unit cell of the crystal lattice. Also, we see directly 
from equation (8.81) that the number of k vectors in this domain is G l G 2 G 3 , 
and their end points are uniformly distributed (the k vectors describe a 
lattice whose unit cell has edges b 1 /G 1 , b 2 /G 2 , b 3 /G 3 ). The number of k 
vectors with end points in a volume d k of fc-space is, therefore 

G 1 G 2 G 3 f2d/c/87i 3 = vdk/Sn 3 , (8.90) 

where v is the volume of the crystal. This result is the same as we obtained 
for free electrons — which might have been expected, for the distribution 
of k vectors is independent of the potential energy, and must be the same 
when this is zero. 

The number G 1 G 2 G 3 of k vectors in the fundamental domain is also the 
number of orbital states in a Brillouin zone, which is thus equal to the 
number of unit cells in the crystal. Since each orbital state can accommodate 
two electrons, with opposite spins, a Biillouin zone can accommodate a 
number of electrons equal to twice the number of unit cells in the crystal. 
If the crystal is a simple Bravais lattice, so that each unit cell contains 
only one atom, a Brillouin zone can accommodate two electrons per atom 
of the crystal. If the number of electrons per atom is greater than two, 
the electrons must be spread over several Brillouin zones. 

It must be pointed out that what we have called the fundamental domain 
of k is itself generally called the first Brillouin zone or the reduced zone, 
for reasons similar to those given in the case of the one-dimensional lattice. 
Regions of the first Brillouin zone, in this terminology, may be translated 
through reciprocal lattice vectors to form polyhedra symmetrically sur¬ 
rounding the first zone. The bounding planes of these polyhedra still bisect 
the position vectors of reciprocal 
lattice points, so that they are given 
by equation (8.85). The region 
between the first zone and the 
second smallest polyhedron is then 
called the second Brillouin zone, 
and so on, each zone having the 
same volume (see Figure 8.14). This 
is called the extended zone scheme, 
and it permits the energy e{k) to 
be described as a single-valued 
function of k, which now ranges over the whole of fc-space. On the 




V 


(a) (b) 

Fig. 8.14. The first and second Brillouin 
zones for a simple cubic lattice, ac¬ 
cording to the extended zone scheme. 
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other hand, the first method we described, in which k is restricted to 
the fundamental domain, and the energy is an infinitely many-valued 
function of k, is often called the reduced zone scheme. Both schemes 
are equally valid, and, if the sets of states described as the first, second, 
third, . . . Brillouin zones in the reduced zone scheme are those allotted 
to the first, second, third, . .. Brillouin zones, respectively, in the extended 
zone scheme, the correspondence between the two schemes is exact — 
we shall always assume this to be the case. It should be remarked, 
however, that the allotment of states to the various zones in the extended 
zone scheme is not unique, so that the reduced zone scheme must be 
regarded as the more fundamental — this argument has been elaborated 
in the one-dimensional case (§ 8.2). 

8.8. Energy Bands and Energy Gaps 

Within any Brillouin zone the energy is a quasi-continuous function of 
k, but energy gaps generally occur at the zone boundaries, that is, on the 
planes whose equations are (8.85) or (8.86). This may be shown by an 
extension to three dimensions of the perturbation method described in § 8.3. 

The unperturbed wave functions are now the free-electron functions 



(8-91) 


which are normalized in the volume v of the crystal. In one dimension we 
found that energy gaps occurred when the matrix element V ki _ k of the 
potential energy did not vanish. Here the degeneracy is much greater than 
in the one-dimensional case, for all the unperturbed states which lie on a 


sphere of radius k in k-space have the same energy, h 2 k 2 /2m. None the less, 


energy gaps can only occur when some of the off-diagonal elements of the 
secular determinant are different from zero, that is to say, when the k 
vector is such that the matrix element V k k >, where |fc| = \k'\, does not 
vanish. 

The matrix element is 




(8.92) 


the integral being over the volume of the crystal. We will evaluate this 
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by means of a device which has already been used in the one-dimensional 
case (at the end of § 8.2). The integral may be reduced to an integration 
over a unit cell of the lattice by making use of the periodicity of V(r), 
expressed in equation (8.70). Letting r = r' + R n , we find 

V kk . = -E f e i(k _fc) ' (r+Rm) V(r')dr' 

V n ^ cell n 

= i^e i(fc ' _fc) R " f e i(k _fc) ’ r F(i*)dr, (8.93) 

V n J cell 


the summation being over all the cells of the lattice. Now, from equation 
(8.65) and (8.76), we obtain 


(k'-k) R n = 2n 


- Gi 


+ {g'i-Qi)n 2 + (g3-g 3 )»3 "[ 

g 2 g 3 J ’ 


so that V ktk > contains the factor 

Gi — 1 Gz — 1 G3 - 1 

^ gi 2 n(gi' — gi)ni/Gi ^ ^ 2 n{g 2 f ~9z)nzlG 2 ^ ^ 2 n{ 9 3 ~92)1*3^3 

ni = 0 n 2 = 0 n 3 — 0 

and this is zero, unless 

g[ = g’ 2 = g 2 — m 2 G 2 , g' 3 = g 3 — m 3 G 3 , 

where m 2 , and m 3 are positive or negative integers or zero. Thus, using 
equations (8.79) and (8.81), we see that V k<k - is zero unless 

k' = k-K m . (8.94) 


But \k'\ = \k\, so that V k k - is zero unless 

\k\ 2 = \k — K m \ 2 , 


or 

2 kK m = \K m \ 2 , 

which is equation (8.85), defining the bounding planes of the Brillouin 
zones. Equation (8.86) could be obtained equally well by changing the 
sign of K m in (8.94). 

We have seen [equation (8.82)] that functions of the form exp (i K m ■ r ) 
have the periodicity of the lattice, and it follows that 

f e >(K m -Ki)- r^ r _ -f. 


= V, if K m = K,. 


(8.95) 
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The potential energy may therefore be expanded in a triple Fourier series 
of the form 

V(r) = I K m e iKm r , (8.96) 

m 

the sum being over all the positive and negative integral values (including 
zero) of m t , m 2 , and ra 3 . Application of the method described in Appendix 
2 shows that the Fourier coefficient V m is given by 

F m = - f e -iKm ’ r V(r)dr, (8.97) 

vJ 

which is just what (8.92) becomes when condition (8.94) is satisfied. Hence, 
there is a finite energy gap on the plane defined by K m , provided the cor¬ 
responding Fourier coefficient of the potential energy is not zero t. It is 
possible, however, if the lattice is not a simple Bravais lattice, that some of 
the Fourier coefficients may be zero — in other words, that some of the 
terms may be missing from the Fourier series (8.96), as we shall now see. 

Let us suppose that the crystal lattice is a composite lattice, that is, a 
Bravais lattice with a basis. The lattice may then be described as a number 
of interpenetrating Bravais lattices, the number being that of the atoms 
in a unit cell of the composite lattice. As an idealization of the physical 
situation we may assume that the total potential energy V(r) is simply 
the sum of the potential energies due to these Bravais lattices separately. 
Let the unit cell of the composite lattice contain g atoms, whose position 
vectors relative to a corner of the cell are s l} s 2 ,. . . s q (one of these is 
normally chosen to be zero), and let the potential energy of an electron 
due to one of the constituent Bravais lattices only be V'(r). Then 

v(n = t vX'-’ii- ( 8 ' 9 «) 

j= 1 

The Fourier expansion of V'(r) is 

V\r) = Z K m e iKm r , 

m 

and that of V(r ) can therefore be written 

V(r) = t Z t / ;e iKm ‘ (r ' IJ> = Z V m S m e' K " r , (8.99) 

j = 1 m m 

t It must be remembered that first-order perturbation theory has been used here. 
Even when the Fourier coefficient of the potential energy is zero, second-order pertur¬ 
bation theory may give an energy gap, but this will probably be small. 
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where 

S m = f j (8.100) 

j =i 

S m is called the structure factor, and the Fourier coefficients of V(r) are 
given by 

v m = V±S m . 

Clearly, V m and the corresponding energy gap vanish when the structure 
factor is zero, and this is generally so for some K m . 

As an example, let us consider the body-centred cubic lattice as a simple 
cubic lattice with a basis. The first Brillouin zone is then a cube whose 
faces are given by equations (8.88), but it is easily seen that there are no 
energy gaps on these planes. The basis is (0, 0, 0), (^, £), the position 

vectors of the two atoms in the unit cell being s x = 0 and s 2 =$a(e 1 +e 2 +e 3 ). 
Also, from equations (8.87), 

K m = — (m 1 e 1 + m 2 e 2 + m 2 e 2 ), 
a 

and the structure factor is, therefore, 

g _ i _j_ e ~irt(w 1 + m 2 + w.O 

At the surface of the first Brillouin zone the values of (m x , m 2 , m 3 ) are 
(+1, 0, 0), (0, +1, 0), or (0, 0, ±1), for all of which the structure factor 
vanishes, together with the energy gap. The first planes at which there is 
a non-vanishing energy gap are those which bound the second Brillouin 
zone for the simple cubic lattice, as shown in Figure 8.14 (b) — it will 
be seen in the following section that, not unexpectedly, this constitutes 
the first Brillouin zone of the body-centred cubic lattice, when it is constructed 
from the proper Bravais lattice, with rhombohedral unit cell defined by 
equations (8.67). 

The foregoing work shows that along any k direction the energy as a 
function of k is represented by curves like those in Figure 8.2 (b) (reduced 
zone scheme) or those in Figure 8.3 (b) (extended zone scheme). There is 
an important difference between the three-dimensional and the one-dimen¬ 
sional cases, however. In the latter we saw that the energy spectrum was 
broken up into allowed energy bands with forbidden energy gaps between 
them. In the three-dimensional case this need no longer be true, in spite 
of the occurrence of energy gaps in every k direction. This is because an 
energy which falls in an energy gap in one direction may not do so in another 
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direction, so that there may be no energy forbidden to an electron, and no 
gaps in the energy spectrum. We shall see in § 8.12 that this phenomenon, 
which is known as the overlapping of energy bands, or the overlapping of 
zones, is of great importance in the theory of metals. Whether or not the 
bands overlap depends upon the size of the energy gaps, as illustrated in 
Figure 8.15, which shows schematically curves of energy as a function of 
k for two different k directions, according to the extended zone scheme. 
In Figure 8.15 (a) the energy gaps are small and the energy bands overlap. 
In Figure 8.15 (b) the energy gaps are large, and the bands do not overlap, 
so that there is a gap in the energy spectrum. 



Fig. 8.15. A and B are on the first planes of energy discontinuity 
along two lines through the origin of k-space, for which the 
energy curves are the full and broken curves respectively. In 

(a) the energy gaps are small, and the bands overlap; while in 

(b) the energy gaps are large, and the bands do not overlap. Only 
one half of each energy curve is shown, since these are symmet¬ 
rical about k = 0. 


8.9. Brillouin Zones for Some Metallic Structures 

We will now obtain the fundamental domain of k, or first Brillouin zone, 
for each of the important metallic structures described in § 8.5. 

(i) The body-centred cubic lattice 

Substituting a x , a 2 , and a 3 from equations (8.67) in equations (8.80), 
we obtain the reciprocal lattice vectors 

= —(^2 + ^3), 
a 

b 2 =-(e 1 + e 3 ), (8.101) 

a 

, 2n , . 

b 3 -(e x + e 2 ). 

a 
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Comparing these with equations ( 8 . 68 ), we see that the reciprocal lattice 
is face-centred cubic. 

A vector K m is given by 

K m = m 1 b 1 + m 2 b 2 + m 3 b 3 

2,Jl 

= — l(m 2 + rn 3 )e 1 +(m 1 + m 3 )e 2 + (m 1 + m 2 )e 3 ], 
a 

and the bounding planes of the first Brillouin zone bisect the twelve shortest 
vectors K m , for which (m ls m 2 , m 3 ) are permutations of the sets of values 
(1, 0, 0,), (-1, 0, 0), (1, -1, 0). These vectors are 

— (±e 1 ±e 2 ), — (±e 2 ±e 3 ), — (±ei±e 3 ), (8.102) 

a a a 

and equation (8.85) gives, for the equations of the bounding planes, 

+ k 1 + k 2 = —, +k 2 + k 3 = —, + k 1 ±k 3 = —. (8.103) 

a a a 

The first Brillouin zone is thus the rhombododecahedron shown in Figure 
8.16 (a) — as we have mentioned previously, it contains both the first and 
second zones for the simple cubic lattice [see Figure 8.14 (b)]. As there 
is one atom per unit cell of the Bravais lattice, the zone can accommodate 
two electrons per atom of the crystal. 

(ii) The face-centred cubic lattice 

From equations ( 8 . 68 ) and (8.80), we obtain 

, 2n , v 

b 1 — \ e l+ e 2 — e 3)’ 


, 2,71 , x 

b 2 = —( — e l + e 2 + e 3 ), 


(8.104) 


, 2n, v 

b 3 = — { e i~ e 2 + e 3 ), 
a 

Comparing these with equations (8.67), we see that the reciprocal lattice 
is body-centred cubic. 

Using the same method as above we find the shortest K m to be the eight 
vectors 

— (±«i ±e 2 ±e 3 ), 
a 
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and the next shortest to be the six vectors 

471 471 , 4n 

± — e i, + — i — e 3> 

a a a 

The first Brillouin zone is the truncated octohedron shown in Figure 8.16 (b). 
The equations of the hexagonal faces, which bisect the shortest K m vectors, 
are 

+ fci + /c2 + /c 3 = — > (8.105) 

a 


and the equations of the square faces, which bisect the next shortest K m 
vectors, are 


k i = 





a 


(8.106) 


This zone again can accommodate two electrons per atom of the crystal. 


(iii) The close-packed, hexagonal lattice 
From equations (8.69) and (8.80), we obtain 


b 2 




471 

~W * 2 ’ 
V 3a 



(8.107) 


These are the reciprocal vectors of the simple hexagonal lattice and they 
also define a simple hexagonal lattice. The first Brillouin zone is thus the 
hexagonal prism shown in Figure 8.16 (c). The bounding planes bisect the 
vectors ±b 1} ±(b l + b 2 ), ±b 2 , ±b 3 , and their equations are, therefore, 

±ki± k 2 = ^ , k 2 = ± —j=r- , k 3 = ± - . (8.108) 

V 3 3 « V 3a c 

The position vectors of the two atoms in the unit cell of the close-packed 
hexagonal lattice are = 0 and s 2 = + 3 a 2 +i fl 3 * The structure 

factor is, thus, 

S m = i+e" i ’ t « m ‘ + ^ +m 3) j ( 8109 ) 

and this is zero when K m — +b 3 . Hence, there is no energy gap at the 




Ch. 8, § 9, 10] ENERGY SURFACES BY THE TIGHT BINDING METHOD 231 


hexagonal faces, k 3 = ±n/c. The zone can accommodate two electrons 
per unit cell of the crystal, which is only one electron per atom. The region 
containing the first and second Brillouin zones, which we will not consider 
in detail, can accommodate two electrons per atom, but it also contains 
planes of energy discontinuity within it (the rectangular faces of the first 



Fig. 8.16. The fundamental domain of k, or first Bril¬ 
louin zone, for (a) the body-centred cubic lattice, (b) the 
face-centred cubic lattice, and (c) the close-packed hexagonal 

lattice. 

zone). It may be shown that the smallest region bounded on all sides by 
planes of energy discontinuity can accommodate 



electrons per atom. 

8.10. Energy Surfaces by the Tight Binding Method 

When discussing the energy bands in actual metals it will be useful to 
have before us a definite model, even if a somewhat idealized one. The 
essential problem is to find the eigenvalues of the Schrodinger equation 
(8.71), but this cannot be done analytically for any real metal, and accurate 
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results using numerical methods have only been obtained at isolated points 
in fc-space (see the following chapter). We therefore turn to the tight 
binding method to provide a simple semi-quantitative example. Also, we 
will only consider its application to simple Bravais lattices and to energy 
bands derived from non-degenerate atomic s states. 

The method is a direct extension to three dimensions of that described 
in § 8.4, so we need give no more than an outline here. We assume that the 
potential energy of an electron in a free atom is U(r), and that s 0 is the 
energy of an s state with wave function x(r). Thus, %{r) satisfies the Schro- 
dinger equation 

V 1 X+^bo~U(r)]x = 0. (8.110) 

n 

We now consider an electron in a lattice of ions, its potential energy due 
to the ion at R„ being U(r—R n ), so that its total potential energy is 

V(r) = Y. U(r-R.), 

n 

the sum being over all the lattice points, and the Schrodinger equation 
(8.71) becomes 

O Ml 

VV+ -j 0- I U(r-RM = 0. (8.111) 

n n 

On the assumption that the functions x(r — R n ) centred on different lattice 
points do not overlap very much, we construct approximate solutions of 
(8.111) of the LCAO type, 

'l'u(r) = le ik ' Rl x(r-R l ), ( 8 . 112 ) 

i 

the coefficients having been chosen so that the functions satisfy the Bloch 
condition (8.73); thus, 

Ur + Rn) = le ik ' Rl x(r -*, + *„) 

l 

I 

= e ik-R "^ k (r). (8.113) 

If, for purposes of normalization, we ignore overlap between atomic 
functions centred on different lattice points, and assume that these functions 
are normalized in the volume of the crystal (or, nearly enough, in all 


Ch. 8, § 10] ENERGY SURFACES BY THE TIGHT BINDING METHOD 233 


space), the normalizing factor for < p k is easily found to be N~*, where N 
is the number of atoms in the crystal. 

Following the same procedure as in the one-dimensional case [see equa¬ 
tion (8.61)], we find the approximate energy to be 

e(k) = £ 0 + X Q ik Rn f X *(r + Rn)lV(r)-U{r)-\ X {r)dr , (8.114) 

n J 

the integral being throughout the volume of the crystal. If we now neglect 
overlap except between nearest neighbouring atoms, this equation reduces to 

e(k) = e 0 -a-y£e ikRn , (8.115) 

n 

where 

a= - 

y = - j X*(r + R n )[V(r)-U(r)~\x(r)dr, 

and the sum is over the position vectors of nearest neighbours of the atom 
at the origin only — since y(r) is a spherically symmetrical function, y is 
the' same for all nearest neighbours. We may take U(r) to be negative, 
so that V(r)—U(r) is negative, and, for s states with small overlap, a and 
y are positive. 

For the simple cubic lattice the required R n are ±ae±ae 2 , +ae 3 , 
so that 

s(k) = e 0 — a — 2y(cos k t a + cos k 2 a + cos k 3 a). (8.116) 

The energy levels are thus spread over a band of width 12y. It is immediately 
obvious that the energy is a triply periodic function of k, and that all states 
are accounted for if k is restricted to lie within a cube bounded by the 
planes 

k, = ± - , k 2 = ± - , k 3 = ± - , 

a a a 

which is what we have already found to be the fundamental domain of k. 
The normal derivative of e(k ) vanishes on the bounding planes, as may be 
verified by differentiating (8.116); thus, 
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When k is small we may write, approximately, 

e(k ) = e 0 —a —6y + ya 2 k 2 , (8.117) 


and, as in the one-dimensional case, we define the effective mass near 


k = 0 as 


m 


h 2 
2 ya 2 


(8.118) 


Along the k lt k 2 , and k 3 axes the energy is given as a function of k by 
curves like that shown in Figure 8.8. The state of lowest energy is at the 
centre of the zone and the states of highest energy are at the corners. Every 
state of the free atom will give rise to an energy band, but some of them 
will have energy curves with maxima at the centre, similar to that marked 
/ = 1 in Figure 8.2 (b), or at other points in the zone. The simple procedure 
described above applies, of course, only when the atomic state is non¬ 
degenerate and tightly bound. A degenerate atomic state will give rise to 
a number of energy bands or zones equal to the degree of degeneracy. 

Surfaces may be drawn through all points in fc-space representing states 
with the same energy, and these are called energy surfaces. Figure 8.17 
shows three typical energy surfaces for 
the simple cubic lattice, as given by 
equation (8.116). It follows from 
equation (8.117) that, near k = 0, the 
surfaces are approximately spheres, as 
for free electrons, but as they get 
larger they bulge more and more 
towards the centres of the zone faces, 
until eventually they touch* and then 
intersect, the zone boundary. The 
energy surfaces which intersect the 
zone boundary do so orthogonally, 
since the normal derivative of e(k) 
vanishes on the bounding planes. A 
cross-section through energy surfaces 
at equal energy intervals in any one 
of the coordinate planes is shown in 
Figure 8.18. 

It would appear, at first sight, that 
the energy surfaces (a) and (c) in Figure 8.17 are of completely different 



(a) (b) 



Fig. 8.17. Energy surfaces for the 
simple cubic lattice (tight binding 
approximation), corresponding to 
energy (a) near the bottom of the 
band, (b) about half way up the 
band, (c) near the top of the band. 
The origin of space is at the centre 
of the cube, and the k u k 2 , k 3 axes are 
parallel to cube edges. 
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forms, but this is not really so, and is due entirely to our choice of 
the fundamental domain of k. Owing to the periodicity of e(k ) in fc-space, 
we could equally well have chosen our basic cell to be a cube centred at 
one of the corner points of our present cell. The energy surface (c) would 
then have been approximately spherical, and (a) would have consisted of 
eight separate octants at the corners of the cube. If k' is one of the eight 
vectors defined by 


k\ = k t ± -, k' 2 = k 2 ± -, k 3 = k 3 + -, (8.119) 

a a a 

equation (8.116) becomes 

e(fc') = e 0 -a + 2y(cos k\a+ cos k' 2 a + cos k 3 a). (8.120) 

Thus, if k' is small, that is, in the neighbourhood of any of the corner points 
in Figure 8.17, we have, approximately, 

e(fc') = e 0 — a + 6y—ya 2 k' 2 . (8.121) 


In other words, the energy surfaces are again approximately spheres, as 
for free electrons, and we may define 
the effective mass near a corner point as 

h 2 


jjc 


2ya‘ 


( 8 . 122 ) 


m* is 


Z77A 

/(C 



J)X 


A 


which is equal to — m*, where 
the effective mass near k = 0. 

The simple tight binding method we 
have described is certainly not appli¬ 
cable to the valence or conduction 
electrons in a metal, which are of 
most interest to us, although it is a 
reasonable approximation for the inner 
shells. Also, we have considered in 
detail only the simple cubic lattice. 

None the less, the qualitative features 
of the energy surfaces we have obtained 
are believed to hold for the zones con¬ 
taining the valence electrons of a real metal. That is to say, the energy 
surfaces near the centre of the first Brillouin zone (taking this to represent 
the state of lowest energy of a valence electron in the metal) are very nearly 


Fig. 8.18. Cross-section through energy 
surfaces at equal energy intervals in any 
one of the coordinate planes of /c-space, 
for tightly bound electrons in a simple 
cubic lattice. 
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spheres, but they bulge towards the nearest points of the zone boundary 
as they get larger. Since the surfaces are always spheres for free electrons, 
the amount of bulging clearly depends upon the potential energy function, 
and it must therefore differ even among metals with the same crystal 
structure — this point is not brought out in our tight binding example. 
Owing to the periodicity of e t (k) we have 

£i( fc i) = £ i( k i)- (8.123) 

where k 1 and k 2 are corresponding points on opposite faces of the zone, 
and the inward normal derivative at k 1 equals the outward normal deriva¬ 
tive at k 2 . It follows from this and the symmetry of the energy surfaces 
that generally (but not invariably) the normal derivative of e t (k) is zero 
over the zone boundary — in other words, energy surfaces which intersect 
the zone boundary generally do so orthogonally. 


8.11. The Density of States in Energy 

We may define a density of states in energy jV(e) in the same way as 
for free electrons. That is to say, J r (e)de is defined as the number of orbital 
states with energies lying between e and e + de, where de is a very small 
interval which still contains a large number of energy levels. Thus, 

•/T(e)de = fdfc, (8.124) 

8 n J 


the integral being over the volume of fc-space lying between the surfaces 
of energy e and e + de. Now, grad k e(fc), that is, 

de de de 

e \ ~ -b e 2 ~ -be 3 — , 

ok 1 ok 2 ok 3 


is the derivative of e(k) in a direction normal to the energy surface through 
the point k , and hence the distance between the surfaces of energy e and 
e+de at this point is de/|grad fc e(fc)|. Therefore, 



dS _ 

lgr a d k e(k)\ ’ 


(8.125) 


this being a surface integral over the energy surface e(fc) = e. 

The calculation of the density of states using this formula requires a 
knowledge of the energy surfaces, and, since these are not known accurately 
for any real metal, an exact determination of jV (e) is not yet possible. 
The difficulties are enhanced by the fact that the potential energy V(r) 
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to be used in the Schrodinger equation is itself a source of much speculation. 
None the less, approximate calculations of (e) have been made, which 
suffice for many purposes. Also, some idea of the shapes of «/T(e) curves 
may be obtained from experimental data, such as X-ray spectra and 
electronic specific heats, as discussed in Chapter 7. 

In the tight binding example considered above, the energy near k = 0 
may be written in The jbrm 

e(k) = ~ k 2 , (8.126) 


if the zero of energy is taken to be the lowest energy in the zone. It follows, 
therefore, that the density of states in this neighbourhood is given by the 
same expression as that for free electrons, equation (7.5), with the effective 
mass m* replacing m; thus. 


^(e) = 



(8.127) 


Near the corner points of the cube the energy surfaces are again almost 
spherical, and equation (8.121) shows that here we may write 

£„-£(*) = 21/c' 2 , (8.128) 

2m 


where k' is the vector defined relative to the appropriate corner point by 
equations (8.119), e H is the highest energy in the zone, and m* is the same 
positive effective mass as used in equation (8.127). It follows that, for 
energies near e H 

•'W = z? (S 1(£H “ e) * (8 ' 129) 

The jV(z) curve in this example is therefore parabolic at both ends, and, 
if the whole curve is computed from equations (8.116) and (8.125), it is 
found to have the shape shown in Figure 8.19 (a). The curve is symmetrical 
about the mean value of the energy, but this is a result of the tight binding 
approximation and is not expected to hold in general. In the case of free 
electrons, for example, if we restrict ourselves to a single zone, the energy 
surfaces are spherical and the^C(e) curve is parabolic until the zone boundary 
is reached, after which the energy surfaces consist only of portions of 
spheres lying within the zone and theyT(fi) curve descends, as shown in 
Figure 8.19 (b). For the zones containing the valence electrons of a metal 
the J'Xs) curve will lie between (a) and (b), and hence will have a shape 
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something like that shown in Figure 8.19 (c). The curve is parabolic at 
both ends, although not with the same magnitude of the effective mass, 
and is not symmetrical about the mean energy. The maximum occurs 
roughly where the energy surfaces first touch the zone boundary, and the 
increased slope of the curve just before the maximum is due to the bulging 



(C) 

Fig. 8.19. Density of states for a single zone of an electron 
in a simple cubic lattice: (a) tight binding approximation, 
(b) free-electron approximation, (c) a compromise 
between the two, which is probably the approximate 
shape of the curve for a zone containing the valence 
electrons of a real metal. 


of the energy surfaces, which increases the volume enclosed between two 
surfaces separated by energy de. Although we have only considered the 
simple cubic lattice, the •sV'ie) curves for actual metallic structures must be 
of similar form — there may, however, be a subsidiary maximum if the 
zone is bounded by two sets of planes at different distances from the origin. 
It follows from the discussion of § 8.7 that 
the area enclosed by the e) curve and the 

e axis must equal the number of unit cells 
in the lattice. 

So far we have been discussing a single 
Brillouin zone. There will, of course, be a 
density of states curve for every zone. In the 
case of tightly bound electrons, when the 
energy gaps at the zone boundaries are large, 
as shown in Figure 8.15 (b), the energy bands 
and corresponding ^(e) curves of adjacent zones will be completely separated. 
In the case of electrons which are nearly free, however, so that the energy 



Fig. 8.20. The full curve is the 
total density of states for two 
overlapping zones. 
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gaps are small, as shown in Figure 8.15 (a), the energy bands and jF{f) 
curves will overlap. The total density of states is then obtained by summing 
the contributions from the overlapping zones, and the <A r (e ) curve will not 
fall to zero (see Figure 8.20). 

8.12. Metals, Insulators and Semiconductors 

In the foregoing sections we have been principally concerned with the 
calculation of the energy levels of a single electron moving in a crystal 
lattice. In a solid, of course, there is a large number of electrons, and the 
interaction of the electrons among themselves has an important effect 
upon the motion. As an approximation, however, we may regard each 
electron as moving in the same self-consistent field, so that the same energy 
band system applies to all of them. Now, an orbital state can accommodate 
two electrons, with opposite spins, and hence, at the absolute zero of 
temperature, all the orbital states of lowest energy will be occupied by 
two electrons each. We will denote by e F the highest occupied energy 
level at this temperature, and the surface drawn through all points of k- 
space representing states with this energy is called the Fermi surface t. 

Let us for the moment consider only simple Bravais lattices, such as 
the cubic structures. We have seen in § 8.7 that, in this case, a Brillouin 
zone can accommodate two electrons per atom of the crystal. In general 
there are more than two electrons per atom, and therefore the occupied 
states must extend over several zones. If there is an even number of electrons 
per atom, and the highest occupied zone does not overlap the succeeding 
one, then the electrons must completely fill the occupied zones. On the other 
hand, if there is an odd number of electrons per atom, whether or not the 
zones overlap, the highest occupied zone, at least, must be only partially 
filled. If the two highest occupied zones overlap, of course, then, whatever 
the number of electrons per atom, the highest zone, at least, must remain 
only partially filled. These facts have an important bearing upon the nature 
of solids, as we shall now see. 

Metals are distinguished from other solids principally by the fact that 
they are good conductors of electricity. The application of an electric 
field, however small, to a metal must result in an electric current, that is, 
an overall drift of electrons in one direction. Now, a free electron in the 


t e F is sometimes referred to as the Fermi energy, but we prefer to use this term in 
another connection (see § 7.2), and so will refer to e f as the energy at the Fermi surface. 
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state fc has momentum hk, and its velocity is therefore hk/m. This is not 
true in general for an electron in a periodic field, but we will show in Chapter 
11 that the average velocity <X> of an electron in the state k is always 
given by 

= § rad k e ( fc )* (8.130) 

Owing to the symmetry of s(fc), therefore, the average velocity of an 
electron in the state k is equal in magnitude and opposite in direction 
to that of an electron in the state -k. At the absolute zero of temperature, 
when there is no applied field, all those states are fully occupied which fie 
within the Fermi surface, and, since this is symmetrical about the origin, 
to every electron with a given average velocity there is one with equal 
and opposite average velocity. The total current is therefore zero. 

If a solid is to be a conductor, then, it must be possible for an applied 
field to excite electrons to states of slightly higher energy, in such a way 
that the velocity distribution becomes asymmetrical. This clearly is not 
possible if the occupied region of fc-space is bounded on every side by 
planes of energy discontinuity, that is, if all the occupied zones are com¬ 
pletely filled; for the nearest energy level into which an electron could 
be excited is then separated from the highest occupied level by a finite 
energy gap, and only a very large field could excite an electron over this 
gap (electrons cannot be excited within the filled region of fc-space, of 
course, because all the states there are occupied). Such a solid must therefore 
be an insulator. If the highest occupied zone of a solid is only partially 
filled, however, a very small applied field can excite some of those electrons 
at the Fermi surface into slightly higher energy levels, and such a solid is a 
metallic conductor. The essential difference between an insulator and a 
metal is, therefore, that in the former all the occupied zones are com¬ 
pletely filled, and in the latter at least one zone is only partially filled. 

The tightly bound inner shells of an atom contain an even number of 
electrons. The tight binding method shows that, in a solid, these shells 
will give rise to a number of very narrow energy bands which are com¬ 
pletely filled and separated from the energy bands of the outer shells by a 
large energy gap. These inner shells cannot contribute to the conductivity, 
therefore, and in discussing conduction properties we may ignore them and 
concentrate upon the more loosely bound outer shells, and, in particular, 
the valence electrons. Without loss of generality we may take the origin 
of fc-space, in either the extended or the reduced zone schemes, to correspond 
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to the state of lowest energy of a valence electron in the solid. Also, in 
drawing ^T(e) curves, we will take the zero of energy to be the lowest energy 
of a valence electron — in such diagrams the energy bands of the tightly 
bound inner shells would appear almost as delta functions far to the left 
of the origin. In a metal the band of energy levels occupied by valence 
electrons is generally referred to as the conduction band, although it may 
extend over several zones. 

It follows from the previous discussion that a solid with an odd number 
of valence electrons per atom must be a metal. The fact that there are many 
metals with an even number of valence electrons per atom is due solely 
to the overlapping of the zones containing these electrons. Figure 8.21(a) 
shows the distribution of occupied states, at the absolute zero of temperature, 
in the conduction band of a monovalent metal, such as sodium. The first 
zone is only half filled, and the distribution is very nearly parabolic, as for 
free electrons — the effective mass will generally be different from m, 
however. Figure 8.21 (b) shows the distribution for a divalent metal, such 
as magnesium t. Here, owing to the overlapping of zones, electrons have 
begun to fill up the second zone before the first zone has been completely 
filled. There are thus two unfilled zones, both of which contribute to the 
conductivity. Figure 8.21 (c) shows the ^(e) curve for the highest filled 
zone of an insulator, such as diamond — it is separated by a large energy 
gap from the next zone, which is empty. 

It may be observed in Figure 8.21 (b) that the density of unoccupied 
states is very nearly parabolic. In this case it is generally more convenient 
to describe the conduction properties of this zone in terms of these un¬ 
occupied states. Now, we have seen that the current due to a completely 
filled zone is zero. Hence, the current due to a zone with a single unoc¬ 
cupied state (including spin) must be the negative of the current due to a 
zone which is empty except for a single electron in this state. This means 
that, in describing conduction processes for a nearly filled zone, the electrons 

t It should be noted that metallic magnesium, and indeed most of the divalent metals, 
have the close-packed hexagonal structure, which is not a simple Bravais lattice. The 
first Brillouin zone can accomodate only one electron per atom, and we have seen in 
§ 8.9 that it is not bounded on all sides by planes of energy discontinuity. However, 
there is a zone (consisting of the first and second Brillouin zones) which can accom¬ 
modate two electrons per atom and is bounded on all sides by planes of energy dis¬ 
continuity, and the first >'(e) curve in Fig. 8.21 (b) may be taken to represent this larger 
zone, which we will refer to as the first zone in the present context. The situation is com¬ 
plicated by the fact that planes of energy discontinuity occur within this larger zone, so 
the diagram must be regarded as only schematic. 
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may be ignored and each unoccupied state (including spin) may be treated 
as if it were occupied by a positively charged particle, known as a positive 
hole, the magnitude of the charge being the same as that of an electron. 
Needless to say, this is only a convenient way of viewing the situation, 
the current always being due to the motion of electrons. 

So far we have only considered solids at the absolute zero of temperature. 
In a metal, at higher temperatures, the distribution of occupied states will 
not fall abruptly to zero, as in Figures 8.21 (a) and (b), but, according to 
the Fermi-Dirac statistics, will tail off smoothly, as shown for free electrons 
in Figure 7.4. The effective width of the thermally disturbed region is only 
of the order of kT, however, which has the value 0.026 eV at 300“ K, so 
that except at very high temperatures the distribution is little different 
from that at absolute zero. In the case of insulators, an increase in the 
temperature will result in the excitation of some electrons from the highest 



Fig. 8.21. The shaded regions show the distribution of occupied states, at the absolute 
zero of temperature, for (a) a monovalent metal, (b) a divalent metal, (c) an insulator, 

and (d) an intrinsic semiconductor. 


filled zone to the first empty zone, so that electronic conduction is possible 
in principle. If the energy gap is large, however, as in Figure 8.21 (c), the 
number of excited electrons will be too small to give rise to an appreciable 
conductivity. On the other hand, in the solid whose ^V(e) curve is shown 
in Figure 8.21 (d), which is an insulator at absolute zero, the energy gap 
is so small that at normal temperatures a sufficient number of electrons 
will be excited to give rise to a measurable conductivity. Such solids, examples 
of which are silicon and germanium, are called intrinsic semiconductors. 
The energy gaps in silicon and germanium are 1.2 and 0.8 eV respectively, 
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while that in diamond, for example, is about 6 eV. It should be noted that, 
in the case of semiconductors, the filled energy band containing the valence 
electrons is called the valence band, and the empty band into which electrons 
are thermally excited is called the conduction band, although the holes left 
in the valence band also affect the conductivity. 

A more important source of conduction electrons and holes in semi¬ 
conductors is the presence of impurities. These may give rise to energy 
levels lying inside the energy gap between the valence and conduction 
bands. It is then very much easier for electrons to be excited from these 
levels, if occupied, into the conduction band, or from the valence band 
into these levels, if unoccupied, than for electrons to be excited from the 
valence band into the conduction band. The conductivity at normal tem¬ 
peratures is thus much greater than for purely intrinsic semiconductivity. 
Impurity energy levels from which electrons are thermally excited into the 
conduction band are called donor levels, and they may arise when the 
dissolved impurity atoms have a higher valency than the solvent — for 
example, phosphorus (valency 5) in silicon (valency 4). A semiconductor 
which contains such impurities is called an n -type semiconductor, the n 
standing for the negative charge of the excited electrons which are the 
current carriers. On the other hand, impurity energy levels to which electrons 
are thermally excited from the valence band are called acceptor levels, 
and may arise when the dissolved impurity atoms have a lower valency 
than the solvent — for example, boron (valency 3) in silicon. A semi¬ 
conductor which contains such impurities is called a p -type semiconductor, 
the p standing for the positive charge of the holes in the valence band 
which are here the current carriers. 

Finally, we must briefly mention a class of metals whose energy band 
structures are slightly more complicated than those we have considered 
above, owing to the presence, in the free atoms, of electrons in d states 
whose energies do not differ greatly from those of the valence electrons. 
These are the transition metals and those which immediately follow them 
in the Periodic Table. As a specific example, we will consider the technolo¬ 
gically important iron group of transition elements, scandium, titanium, 
vanadium, chromium, manganese, iron, cobalt, and nickel, all of which have 
incomplete 3d sub-shells (see § 4.3) in addition to 4s electrons, and at 
the same time we may consider copper, in which the 3d sub-shell is complete 
and there is one 4s electron. The numbers of 3d and 4s electrons in each 
of these elements are shown in the following table: 
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Sc 

Ti 

V 

Cr 

Mn 

Fe 

Co 

Ni 

Cu 

Number of 3d 

electrons 

1 

2 

3 

5 

5 

6 

7 

8 

10 

Number of 4s 

electrons 

2 

2 

2 

1 

2 

2 

2 

2 

1 


Further groups of transition elements are obtained when the 4d and 5d 
sub-shells are filling up, but we will not consider these in detail. 

Now, we have mentioned in connection with the tight binding method 
that for every state in a free atom there is a Brillouin zone in the metal. 
It is important to realize, however, that a particular zone can only be 
regarded rigorously as deriving from a particular atomic state (in the 
sense of the tight binding method) if its energy band is narrow and separated 
from those of neighbouring zones by large energy gaps. If several zones 
overlap, they must all be regarded as deriving from the same group of 
atomic states. None the less, it is possible in principle to establish a cor¬ 
respondence between the states of a free atom and the zones in a metal, 
which is conceptually useful. If we consider, not the solid metal under 
normal conditions of temperature and pressure, but a dispersed gas of free 
atoms, it is clear that all the occupied states of the free atoms will be tightly 
bound, according to our definition, for these atoms will be so far apart 
that not even the wave functions of the valence electrons will overlap 
appreciably. We may, for convenience, imagine these atoms to be arranged 
on a crystal lattice of the same structure as that of the solid metal, but very 
much larger, and apply the Bloch method to this expanded lattice. It is 
clear that the tight binding method (with provision made for degeneracy) 
will apply to all the occupied zones, and their energy bands will be extremely 
narrow — in fact, hardly to be distinguished from the energy levels of a 
single free atom. If tins lattice is now contracted, the atomic wave functions 
will begin to overlap and the energy bands corresponding to the various 
atomic levels will begin to widen. They will not all widen at the same rate, 
however — the bands corresponding to the valence electron states will 
widen most rapidly, because the valence electron wave functions are most 
widespread and hence overlap more at large interatomic distances. The 
bands corresponding to the tightly bound inner shells, however, will hardly 
widen at all until the interatomic distance is reduced below that of the 
normal solid metal. In the case of the transition metals considered above 
the wave functions of the 3d atomic states are neither as widespread as those 
of the valence electrons nor as constricted as those of the inner shells, 
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and the corresponding band will have an appreciable width at the inter¬ 
atomic distance of the normal metal, but this width will be considerably 
less than that of the valence electron band. 

The way in which the various bands are thought to widen in the case 
of copper is shown schematically in Figure 
8.22, and, apart from the relative positions 
of the 3d and 4s atomic levels, the picture 
is believed to be very similar for the 
transition metals. We may call the band 
corresponding to the atomic valence 
electron level the 4s band and that cor¬ 
responding to the 3d atomic level the 
3d band, although again we must em¬ 
phasize that the electronic states in these 
bands are not obtained simply by linear 
combination of atomic 4s and 3d states 
respectively. As shown in the diagram, 
the band corresponding to the normally 
unoccupied 4p atomic level overlaps the 
4s band at the interatomic distance of 
the solid metal, so that the 4p states must 
strongly affect the wave functions of the valence electrons in the metal. . 

Many of the properties of the transition metals may be explained on 
the assumption, borne out by some detailed calculation, that the 3d band 
in the normal metal is very narrow and overlaps the 4s band. Since the 3d 
atomic state is five-fold degenerate, the 3d band, in spite of its narrow width, 
must cover five Brillouin zones, so that the density of states in the 3d band 
must be very great compared with that in the 4s band. Thus, the 
curve for the 3d band is very tall and narrow, while that of the 4s band 
is flat and wide, as shown schematically in Figure 8.23. Now, ten electrons' 
per atom are required to fill the 3d band completely, and nickel is the only 
transition metal with ten 3d and 4s electrons per atom. However, even in 
nickel, owing to the overlapping of the 3d and 4s bands, some electrons 
must go into the 4s band, and the 3d band is not completely filled, as shown 
in Figure 8.23 (a). Hence, all the transition metals of the iron group have 
incompletely filled 3d bands. In copper, on the other hand, which has eleven 
3d and 4s electrons per atom, there is evidence that the 3d band is com¬ 
pletely filled, and the 4s band therefore contains one electron per atom, 



Fig. 8.22. The dependence of energy 
band widths upon>interatomic distance 
in copper (schematic). The vertical 
broken line is drawn at roughly the 
interatomic distance of the normal 
solid metal. 
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as shown in Figure 8.23 (b) — detailed calculations indicate that the 
width of the 3d band is about half the width of the occupied part of the 
4s band, and that the energy at the Fermi surface is 3 or 4 eV above the 
highest energy in the 3d band. 

The density of states at the Fermi surface is very high in the transition 
metals, owing to the incompletely filled 3d band, and this explains the high 
electronic specific heats of these metals 
— the low temperature electronic specific 
heat of nickel, for example, is ten 
times that of copper. The incomplete 
filling of the 3d band also provides an 
explanation of the ferromagnetism of 
iron, cobalt, and nickel. The ferro¬ 
magnetic state occurs when, in the 
absence of an external magnetic field, 
there are more electrons with one kind 
of spin than there are with the other. 

Hence, if the ferromagnetism is due to 
the 3d electrons, as is believed, it can 
only occur'wJ^en the 3d band is in¬ 
completely filled — if the 3d band is 
completely filled, as in copper, there 
must be equal numbers of 3d electrons with both kinds of spin. Although 
this explains how iron, cobalt, and nickel can be ferromagnetic, while copper 
is not, it does not explain why ferromagnetism occurs in these three metals 
only and not in the other transition metals. The explanation of this depends 
upon an analysis of the terms contributing to the cohesive energy of a metal 
(in particular, the ‘exchange’ term) which is still largely speculative, and 
so we will omit it. 




Fig. 8.23. Density of states (schematic) 
in the 3d and 4s bands of (a) nickel 
and (b) copper. The shaded regions 
show the distribution of occupied sta¬ 
tes at the absolute zero of temperature. 








CHAPTER 9 


THE METHOD OF WIGNER AND SEITZ 


9.1. The Cellular Method 

We will now consider a more direct method of calculating the wave 
functions of the electrons in a metal, which has had much success. This 
was first applied by Wigner and Seitz [1933, 1934] to metallic sodium, and, 
in its more general application, is known as the cellular method. 



Fig. 9.1. Atomic polyhedra for (a) the body-centred 
cubic lattice, (b) the face-centred cubic lattice, and 
(c) the close-packed hexagonal lattice. 


Owing to the periodicity of the function u k (r) appearing in the Bloch 
function (8.74), it is only necessary to solve the Schrodinger equation 
(8.71) within a single unit cell of the crystal lattice. In order to obtain 
the simplest boundary conditions, we choose the unit cell to be a polyhedron 
with the greatest possible symmetry about an atomic position. Thus, we 
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draw lines connecting an atom to its nearest and next nearest neighbours 
and construct the planes which bisect these lines perpendicularly. We then 
choose the unit cell to be the smallest polyhedron surrounding the central 
atom and bounded by these planes. It is variously called the Wigner-Seitz cell , 
atomic polyhedron, cellular polyhedron, or, in older works, the s-polyhedron. 
The atomic polyhedra for the body-centred cubic, face-centred cubic, 
and close-packed hexagonal lattices are shown in Figure 9.1. 

It is apparent that the method of constructing the atomic polyhedron 
is the same as that employed in § 8.7 to construct the fundamental domain 
of k, so that the latter is just the Wigner-Seitz cell of the reciprocal lattice. 
Since we have seen that the body-centred cubic lattice has a face-centred 
cubic reciprocal lattice, and vice versa, it follows that the atomic polyhedron 
for either of these lattices must have the same shape as the fundamental 
domain of k for the other (compare Figures 8.16 and 9.1). The atomic 
polyhedra completely fill the space of the lattice, and, in the case of the 
cubic structures (or other simple Bravais lattices), any one may be brought 
into coincidence with any other by a translation through a lattice vector 
R n , given by equation (8.65). This is not true in the case of the close-packed 
hexagonal lattice, for which a rotation as well as a translation through a 
lattice vector may be necessary to bring two atomic polyhedra into coin¬ 
cidence — this causes a slight complication in the application of the cellular 
method to the close-packed hexagonal lattice, and in the following work 
we will consider only the cubic structures. 

Let us, then, determine the boundary conditions which a Bloch function 
must satisfy at the surface of an atomic polyhedron 
in one of the cubic lattices. We note first that to 
each face of the cell there is a parallel face, and the 
vector going perpendicularly from one to the 
other is a translational vector of the lattice. Suppose 
that O is the centre of the cell (Figure 9.2), and A 
and B are the centre points of a pair of opposite 
faces. The vector AB is thus a lattice vector, which Fig- 9.2. 

we will denote by R. Also, let P and Q be cor¬ 
responding points on the A and B faces, respectively, so that AP = BQ, 
and, if OP = r, then OQ = r + R. 

Now, any Bloch function must satisfy the condition 



^k(r + R) = e ik ‘ R iA fc (r). 


(9.1) 
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Hence, if eR is a small displacement in the direction of R, we have 

il/ k (r + R + sR) = e ikR il/ k (r + eR). 

Expanding both sides of this equation in Taylor’s series (see Appendix 1), 
we obtain 

il/ k (r + R) + eR • Vi j/ k (r + R)+ . . . = e ifcfi |> fc (r) + £R • Vi j/ k (r)+ . . .], (9.2) 

in which all the terms not explicitly given contain, as factors, powers of £ 
higher than the first. Owing to (9.1) the first term can be dropped on 
each side of (9.2). Dividing through by e, and then taking the limit as e 
tends to zero, gives, finally, 

R-ViP k (r + R) = e' kR R ■ W\j/ k (r). (9.3) 

Every wave function must satisfy conditions (9.1) and (9.3) for each pair 
of faces of the atomic polyhedron. 

The boundary conditions may be further modified according to the 
symmetry of \f/ k . It is possible to determine the latter in general using the 
theory of groups, but as this lies outside the scope of the present work we 
will confine ourselves to simple cases where the symmetry is obvious. 
We shall be particularly concerned with the state of lowest energy of a 
valence electron in the alkali metals (Li, Na, K, Rb, Cs), which are mono¬ 
valent and have the body-centred cubic structure. Since the valence electrons 
(that is, electrons whose energies lie in the conduction band) are nearly 
free, we may assume that the state of lowest energy is at k = 0, and we 

denote the wave function by i j/ 0 {r) [or u 0 (r)’ — it has the periodicity of 

the lattice, for the factor exp (ifc • r) is unity when k = 0]. Now, the valence 
electron in a free atom of one of these metals is in an s state, and we expect 
t^o to resemble an s function within an atomic polyhedron — in particular, 
we do not expect it to have any nodal planes. This symmetry type is sometimes 
denoted by the symbol F s — the f referring to the centre of the zone, 

and the subscript s to the fact that the function has no nodal planes. Of 

course, i J/ 0 will not be exactly spherically symmetrical within an atomic 
polyhedron, but at least it will have the full symmetry of the lattice, which 
is quite high, as Figure 9.1 (a) shows. From the symmetry of the function, 
then, it follows that t J/ 0 and its outward normal derivative must have the 
same values at similarly situated points on opposite faces of the cell. That 
is to say, using the notation of Figure 9.2, 
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\lf 0 (r + R) = t/^oO), (9.4) 

R- V^o(r + «) = ~R‘ V^ 0 (0- ( 9 - 5 ) 

The boundary condition (9.1), with k = 0 simply repeats (9.4), but the 
boundary condition (9.3) tells us that 

R-Vil/ 0 (r + R) = R-Vil/ 0 (r). (9.6) 

Comparing (9.5) and (9.6), we see that R ■ Vi l/ 0 (r) must be zero everywhere 
on the surface of the cell. In other words, the normal derivative of ifoW 
must vanish at the surface of the atomic polyhedron. 

The foregoing result may have been obtained more simply. Since 
has the period of the lattice, and is also symmetrical about any nucleus, 
it is necessary for the normal derivative to vanish at the surface of a cell, 
for otherwise the gradient would be discontinuous there. However, this 
applies only to the state of lowest energy, whereas the boundary con¬ 
ditions (9.1) and (9.3) apply to all wave functions. 


9.2. The State of Lowest Energy of a Valence Electron in a Metal: The 
Atomic Sphere Approximation 

To be specific, we will now consider metallic sodium, which was the 
first metal to be treated by the Wigner-Seitz method, and remains the metal 
most ideally suited to this treatment. It should be noted, however, that the 
method has been applied, with great success, to all the alkali metals, and, 
with reasonable success, to other metals. 

Owing to the high symmetry of the atomic polyhedron [Figure 9.1 (a)], 
Wigner and Seitz, for the purpose of calculating i j/ 0 {r), replace the cell 
by a sphere of equal volume. The radius of this atomic sphere is r s , the 
atomic radius, which we have previously defined by 


471 

3~ 



(9.7) 


where v is the volume of the metal and N the number of atoms. 

We may assume as a good approximation that the wave function i p 0 
is spherically symmetrical within the atomic sphere, and that its normal 
derivative vanishes at the surface of the sphere, that is, 



r = r s 


(9.8) 
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It is found, in fact, that 1 1/ 0 is almost constant over a large interval about 
r = r s , so that the function which satisfies the above boundary condition 
very nearly satisfies the true boundary condition (the vanishing of the 
normal derivative at the surface of the cell). 

In order to calculate iJ/ 0 we must first know the potential energy V(r) 
of a valence electron in the crystal, which is sometimes called the crystal 
potential, and this requires further approximations. First, as we have noted 
in § 6.1.1, the dimensions of the ion-core of a sodium atom are small com¬ 
pared with those of the atomic sphere, so that there is practically no over¬ 
lapping of the core electron wave functions of different ions in the metal. 
Consequently, we may assume that the field of an ion-core in the metal is 
the same spherically-symmetrical field as in a free atom. However, the 
field of the ion-cores is not the only field we must take into account: there 
is also that of the other valence electrons. 

The total valence electron charge density is periodic with the period of 
the lattice, since 

IMr)l 2 = te0)| 2 , (9.9) 

so that each cell is electrically neutral, and, as the cells are very nearly 
spherical, it is a good approximation to take the electrostatic field of each 
cell to be zero at points outside the cell. Thus, the field acting on any 
electron + is essentially that of the ion-core in the cell in which the electron 
is located plus the self-consistent field of the other electrons in that cell. 
However, we have seen in Chapters 6 and 7 that, as a result of the Pauli 
principle, an electron is always surrounded by a hole of approximate 
radius r s in the charge distribution of electrons with parallel spins — this 
is known as the Fermi hole. We have suggested that, as a result of Coulomb 
correlations, a similar hole must also exist in the distribution of electrons 
with antiparallel spins. The effects of Coulomb correlations will be in¬ 
vestigated in the following chapter, but meanwhile we will accept the 
existence of a region, roughly of radius r s , surrounding an electron during 
its motion, in which it is unlikely that any other electron, regardless of spin, 
will be found. Hence, the presence of any electron in a cell virtually excludes 
all the other electrons from the cell, and we may assume that the field 
acting on the electron is that of the ion-core in the cell only. This cannot 


t In future, when we refer simply to an electron , we shall mean a valence electron , 
unless otherwise stated. 
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be strictly correct, of course, since the hole follows the electron during its 
motion, and does not remain centrally placed in the cell, but in view of 
other approximations which have to be made this is of no importance — 
it can be shown, in fact, that if the hole is ignored, and the Hartree field 
of the valence electrons included in V(r), the wave function \j/ 0 is not 
significantly changed. 

The wave function i J/ 0 and energy e 0 of the lowest state may therefore 
be found by integrating the Schrodinger equation 


dVo + 2 dl Ao 

dr 2 r dr 


2m 


+ [ £ o~ ^ / ( r )]*Ao — 


(9.10) 


within an atomic sphere, subject to the boundary condition (9.8). V(r) 
is the same spherically symmetrical ion-core potential function as in a free 
atom, and might be obtained, for example, by a Hartree-Fock calculation 
for a free atom. However, the function V(r) used by Wigner and Seitz 
for sodium was a semi-empirical function constructed by Prokofiev, 
which was designed to reproduce with reasonable accuracy the spectrum 
arising from transitions of the valence electron in a free atom. In recent 
years several methods, based upon that of Wigner and Seitz, have been 
developed for finding e 0 directly from the observed atomic energy levels 
(obtained from spectral data) without explicitly calculating V(r) and \}/ 0 (r). 
We will briefly describe the most accurate of these, called the Quantum 
Defect Method, in a later section. 

In practical calculation it is convenient to use the atomic units defined 
in § 3.3, and also to let f(r) = n/^ 0 (r), when equation (9.10) reduces to the 
simple form 

(9.11) 

dr 

and the boundary condition (9.8) becomes 

— = - when r = r s . (9-12) 

dr r 

There is another condition, which we have not mentioned explicitly. Since 
the valence electron in a free atom of sodium is in a 3s state, its wave function 
has two spherical nodes, and calculation shows that these have radii very 
much less than r s — in fact, they lie within the effective radius of the ion- 
core. We therefore require i j/ 0 and / to have two spherical nodes within 
the atomic sphere. This condition is not of great practical importance, 
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for, although equation (9.11) has solutions with any number of nodes, these 
correspond to energies lying far from the range in which we are interested, 
and so would not normally be encountered. 

We wish to use £ 0 in calculating the cohesive energy and equilibrium 
atomic radius of the metal, and, for this 
purpose, we are not so much concerned with 
calculating ip 0 and e 0 for one particular value 
of r s as with obtaining £ 0 as a function of r s . 

The potential energy V(r) is, of course, a tabu¬ 
lated function and the integration of the Schro- 
dinger equation must be carried out numerically, 
but we will not give details of the numerical meth¬ 
ods used. The procedure is to choose a reasonable 
value for £ 0 (that is, one which lies an electron- 
volt or so below the energy of the valence 
electron in a free atom), integrate equation 
(9.11) outwards from the origin, where / is 
zero, and then find the points lying beyond the 
second node of \J/ 0 at which the boundary 
condition (9.12) is satisfied. The result for f(r) and ij/ 0 (r) will, if £ 0 has not 
been chosen too low, be as shown schematically in Figure 9.3. The boundary 
condition is satisfied for two values a and b of r after the second node 
of \l/ 0 . Hence, there are two possible values of r s corresponding to the 
chosen e 0 . 

If the value of e 0 is now lowered slightly (£ 0 is negative so that this 
means a slight increase in its magnitude), and the procedure repeated, 
the two possible values of r s will be found to he closer together. Further 
repetition will finally result in the boundary condition being satisfied at a 
single inflection point of i/'o only, and for a still lower value of £ 0 the bound¬ 
ary condition will not be satisfied at all beyond the second node of \f/ 0 . 
On the other hand, if £ 0 is raised from its original chosen value, the two 
possible values of r s will diverge, until one of them becomes infinite for 
an £ 0 equal to the negative of the ionization energy of a free atom — this 
result is easily comprehended, for V(r) is the ion-core potential function in 
a free atom and the valence electron wave function in a free atom tends 
to zero as r tends to infinity. In this way we obtain £ 0 as a function of 
r s , and its graph is shown in Figure 9.4. 

The wave function i J/ 0 , corresponding to the observed value of r s (3.96 



Fig. 9.3. Graphs of /(r) and 
y> 0 {r) obtained by integrating 
equation (9.11) with a chosen 
value of e 0 . The boundary 
condition (9.8) or (9.12) is 
satisfied at r = a and r = b. 
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B.u.), for metallic sodium is shown in Figure 9.5 (a). The function is almost 
constant in the region of the atomic sphere lying outside the ion-core, 
and the volume of this region is roughly 90 per cent of the total volume. 



Fig. 9.4. Graphs of £ 0 as a function 
of r s for sodium, the free-electron 
Fermi energy Ep, and £ 0 + £p. I is 
the ionization energy of a free atom, 
and S and r 0 are approximations 
to the cohesive energy and equilib¬ 
rium atomic radius, respectively. 


Fig. 9.5. Wave function y> 0 of the lowest state 
of a valence electron in (a) metallic sodium 
(r s = 3.96 B.u.), (b) metallic magnesium 

Os = 3.33 B.u.). 


If \j /q were absolutely constant throughout the whole volume, the electrons 
would be free, so we may deduce that the valence electrons in metallic 
sodium are, in fact, very nearly free. This does not apply only to sodium, 
however. In all metals to which the Wigner-Seitz method has been applied, 
i/o is found to be almost constant in that part of the atomic sphere lying 
outside the ion-core. For example, Figure 9.5 (b) shows i j/ 0 for the divalent 
metal magnesium, obtained in the same way as for sodium. 


9.3. Occupied States of Higher Energy: The Effective Mass 

It must be remembered that s 0 is just the energy of the lowest state of a 
valence electron in the metal (or, more precisely, the lowest eigenvalue of 
the Schrodinger equation for a valence electron in the metal). According 
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to the Pauli principle, only two electrons, with opposite spins, may occupy 
any orbital state, so that all but two electrons must be in higher energy 
levels. In order to calculate the cohesive energy we shall need to know 
the energies of all the occupied states. 

The Schrodinger equation for any state iJ/ k (r) is 


V 2 «AftO)+ -j [e(fc)-F(r)]^ k (r) = 0, 
n 


(9.13) 


where, for the reasons given in the previous section, V(r ) is assumed to be 
the potential energy of an electron due to the ion-core in the cell where the 
electron is located only. This equation holds throughout the crystal, but 
it is only necessary to solve it within a single cell, the boundary conditions 
to be satisfied at the surface of the cell being (9.1) and (9.3). These boundary 
conditions cannot be simplified for a state k in general as they can for 
k = 0, so that the atomic sphere approximation is of no immediate help. 
The mathematical difficulties are such that a direct solution of the equation 
is not practicable and approximate solutions must be sought. 

Wigner and Seitz [1933], in their first paper on metallic sodium, made 
the approximation 


•AfcO) = e ik r u 0 (r) 


(9.14) 


where u 0 (r) [= t j/ 0 (r)] is the function calculated in the previous section 
continued periodically throughout the crystal. The function (9.14) is of the 
Bloch type, since u 0 (r) has the period of the lattice, but the approximation 
lies in the use of u 0 (r ) instead of a function u k (r) which varies with k. However, 
there are several reasons for believing that the approximation is a reasonable 
one. First, if u 0 (r ) were constant, the electrons would be free, so that (9.14) 
would be perfectly correct; and we have seen that u 0 (r) is, in fact, nearly 
constant throughout most of the volume of the crystal. Second, the ex¬ 
pression (9.14) reduces correctly to u 0 (r) when k = 0. Finally, since only 
the lower half of the first Brillouin zone is occupied in the alkali metals, 
we might expect that for the occupied states the true u k (r ) will not differ 
much from u 0 (r). 

If we assume that u 0 (r) is normalized in the volume of the crystal, and 
remembering that it is a real function, we obtain, from equations (9.13) 
and (9.14), the approximate energy of the state t p k (r) as 



2m 


e(k) = 


Q ikr u 0 (r)dr, (9.15) 
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the integral being over the volume of the crystal. Now, 
e' k ' r u 0 (r) = e i(kiX+kzy+k3Z) u 0 (x, y, z), 

so that 

— [e' k ' r u 0 (r)] = e' k r — +ik 1 e ,k r u 0 , 
dx dx 

and 

Differentiating with respect to y and z in the same way, we have, finally, 
V 2 [e ift r n 0 (»] = e ifc ' r (V 2 +2i k ■V-k 2 )u 0 (r ). (9.16) 

Hence 

h 2 k 2 


K k ) = J M o( r ) 


h 2 

2m 


V 2 + V(r) 


u 0 (»‘) d, ‘ + 


m 


2m 


J u 0 (r)Vu 0 (r)dr . (9.17) 


Since u Q (r) is an even function, that is, u 0 (r) = u 0 ( — r), and Vu 0 (r) = 
— Vu 0 (—r), the integral in the last term is zero. Also, the equation for 
u 0 (r) may be written 


h 2 

- V 2 + V(r) 

L 2m 


I'oO) = 8o“d(0- 


(9.18) 


We therefore have 


s(k) = e 0 + 


rSk 2 

2m 


(9.19) 


which is the energy of the lowest state plus the kinetic energy of a free 
electron in the state k. 

For the true wave function of the state k, 

P-20) 

we have, by the same method as used in obtaining equation (9.16), 

VV fc (r) = e ik ' r (V 2 + 2ik • V-k 2 )u k (r). (9.21) 
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Substituting (9.20) in (9.13), therefore, gives the equation 



h 2 L 2 m 


for u k (r). Now, this equation reduces to that of u 0 , equation (9.18), when 
s(k) is given by (9.19) and the term 2i k ■ Vw k is neglected, so that the ap¬ 
proximation (9.14) is a reasonable one so long as this term is very small. 
Whatever the magnitude of Vw k , this term will certainly be small for states 
near k = 0. Hence, for such states, at least, we may obtain a better ap¬ 
proximation to the energy than (9.19) by treating 2ik • Vw k as a small 
perturbation. This was done by Wigner and Seitz [1934] in their second 
paper on sodium, but we will describe a rather more elegant method due 
originally to Bardeen [1938], which gives the final result in a compact 
analytical form. We wish to remark, however, that the derivation of the 
result is not essential to an understanding of the remainder of the book, 
and may be omitted at a first reading. 

Let us, on the assumption that k is small, expand u k in powers of k 
as far as the first, thus 


(9.23) 


u k (r) = Uo(r) + ku!(r), 


where w 0 (r) and u^r) both have the period of the lattice. In the remainder 
of this section it will be convenient to use the atomic units defined in § 3.3, 
so that equation (9.22) becomes 


V 2 u k + 2i k • Vw k + [c(k) - k 2 - V(r)]u k = 0, (9.24) 


and 


[ n*[-V 2 —2i/c ■ V + /c 2 + L(r)> k dr 
J K\ 2 dr 


(9.25) 


e(k) = 


where, owing to the periodicity of u k , the integrals may be taken over 
the whole volume of the crystal or over a single cell. Substitution of (9.23) 
in (9.25), which we will do in detail later, shows that it gives e(k) correct 
to terms of the second orfler in k as 


s(k) = e 0 + a ^ 2 > 


(9.26) 


since the first order term, 
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vanishes owing to the symmetry of u 0 . The constant a is to be determined. 

From equations (9.23), (9.24), and (9.26) we obtain, correct to the first 
order in k, 

V 2 (u 0 + &u 1 ) + 2ifc • V(w 0 + £mi) 

+ [e 0 + ak 2 -k 2 -V(r)](u 0 + ku 1 ) = 0. (9.27) 

Since we are neglecting any dependence of s(k) on the direction of k, 
we can, without loss of generality, choose k to be in the z-direction, 
so that 

k • V(u 0 + ku t ) = k — (u 0 + ku (9.28) 

dz 

Then, equating to zero separately the terms of (9.27) not containing k 
and those containing the first power of k only (this is possible, because, 
when a solution in the form of a series is substituted in a differential equation, 
the result is an identity), we obtain 

[V 2 + e 0 — V(r)]u 0 = 0, (9.28) 

IV + £o —FWK = -2i^2. (9.29) 

dz 

The first equation merely confirms that u 0 = t j/ 0 , as previously defined, 
and the second must be solved subject to appropriate boundary conditions 
for u v 

A particular solution of equation (9.29) is — izu 0 , as we will verify; thus, 
V 2 ( —izu 0 ) = — iu 0 V 2 z — 2iVz • V« 0 —izV 2 u 0 

= 0 — 2i —izV 2 u 0 , 
dz 

so that 

[V 2 + e 0 — F(r)]( — izu 0 ) = -2i~ -iz[V 2 + e 0 - V(r)]u 0 

dz 

= -2i^2, 
dz 

using equation (9.28). The general solution of equation (9.29) is obtained 
by adding to this particular solution the complementary function, which is 
the general solution of the homogeneous equation obtained by neglecting 
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the right hand side of (9.29). However, we do not require the general 
solution, but a solution which satisfies certain boundary conditions, which 
we will now discuss. 

We know that u 0 is an even function, so that dujdz is an odd function, 
and hence, in order to satisfy equation (9.29), iq must be an odd function; 
that is to say, Wj(r) = — u x { — r). It is clear that an odd function which has 
the period of the lattice must change sign at the surface of an atomic 
polyhedron, and so, if it is to be continuous, its value must be zero there. 
Thus, the boundary condition on is that it must vanish at the surface of 
an atomic polyhedron, or, using the atomic sphere approximation. 


(9.30) 


«iW = 0 . 


To the particular solution — izw 0 of equation (9.29) already found we 
must therefore add a solution of the homogeneous equation 


[V 2 + e 0 -V(r)]u= 0 


(9.31) 


which will ensure that boundary condition (9.30) is satisfied — in other 
words, we require 


(9.32) 


u = iz« 0 at r = r s . 


It follows that u must have p symmetry (see § 2.4) within an atomic sphere, 
that is, it must be z times a spherically symmetrical function. Let us write 



(9.33) 


then 





(9.34) 


and equation (9.31) becomes 



(9.35) 
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The conditions on P(r) are that it be zero at the origin and scaled 
so that 

P(r s ) = r s 2 WoO's)- ( 9 - 36 ) 

It should be noted that we are not required to find the eigenvalue of equation 
(9.35), since e 0 is known. Having found P(r ) by integrating (9.35) outwards 
from the origin, and multiplying it by a suitable constant to satisfy (9.36), 
we then have 

u^r) = iz -u 0 ^ , (9.37) 


which- satisfies (9.30). This expression is only valid, of course, within an 
atomic sphere [where, incidentally, V(r) is spherically symmetrical, and might 
be written V(r )] or cell — the function must be continued periodically 
through the crystal. We note that u 0 is a real function while u 1 is pure 
imaginary. 

From equations (9.23) and (9.25), we have 


e(k) = 


o 

+ 

* 

— V 2 + F(r) + k 2 — 2ik — 

(u 0 + kui)dr 

J Q 





u 0 + ku^ 2 dr 


(9.38) 


where the integrals are taken, as a good approximation, throughout an 
atomic sphere Q. Using equations (9.28) and (9.29), this becomes 


s(ft) — Sq + k 2 — 


2i k 2 f (wo + ZcMi)* —^ di¬ 

do dz 

J “» 


dr + k 2 


|»i| 2 dr 


— Sq + k 2 + k 2 



(9.39) 


as far as terms in k 2 . 

Let us assume that u 0 is normalized to unity within an atomic sphere, 
so that 



1 . 


(9.40) 
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Then, as we have previously remarked, 

e(k ) = e 0 +ak 2 , 


where 


a = 1 — 2i 



du^ 

dz 


dr. 


Writing u t = i zg(r), where 

g(r) = 4 ~u 0 , 

r 


(9.41) 


we obtain 

a = 1+2 

i 

= 1+2 


n 

* r s 


«0 \Q+—g'\dr 


Anr 2 u 0 gdr + An r 3 dr 


= 1 + 871 


871 


r 

J 0 

r 

j 0 


./_ „2 


u 0 g' cos 9 sin Odd 


r 2 u 0 gdr+ ^ | r 3 u 0 g'dr 


r, 871 r s d , , . , 

= 1 + 871 r 2 u 0 gdr - g — (r u 0 )dr 

J 0 3 J 0 dr 


= 1 - 


871 


r gu' 0 dr 


0 

** s 


= 1- — f r(P — r 2 u 0 )u' 0 dr. 

3 J 0 


v 9.42) 


We have performed an integration by parts in the fourth line and used 
the fact that g(r s ) = 0. 

In terms of the function/(r) = ru 0 , (9.42) becomes 


a = 1 — 


871 




(9.43) 


Now, 


rff dr = [r/ 2 ]o - (f + rjf')dr. 
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and, from (9.40), 



4n | f 2 dr = 4n \ r 2 uldr = 1, 

Jo Jo 


so that 

- fV/'dr = ^ [r/ 2 ]o — y [f 2dr 

3 J 0 3 3 J 0 



4n 3r , .-|2 1 

= — r s KOs)] - - > 

(9.44) 

and 


(9.45) 


Also, 

= [Wo" r~(rP' + P)dr, 

J 0 r 

and, hence, 

2pP (/'- ^ dr = [PfYo - (fP' - Pf + ^y)- dr -. ( 9 - 46 ) 

From equations (9.11) and (9.35), for / and P, respectively, we have 


which gives 


4- WP'-Pf ).] = fP’-Pf'+ — • (9.47) 

dr r 


From (9.45), (9.46), and (9.47), we obtain 


An 


An 


An 


« = J rfroM] 2 - J P(rif(r.) + j r.[/(r ! )P'(r.)-P(r.)/'(r ! )]. (9.48) 
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However, boundary condition (9.12) for / tells us that 

/'W = = w o(r s ), 

and equation (9.36) gives 

P(r s ) = r 2 u 0 (r s ) = rj(r s ). 

Hence, 

r s P(r s )f'(r s ) = P(r s )/(r s ) = r s 3 [u 0 (r s )] 2 , 

and 

12 -P'W 


r s f(r s )P'(r s ) = OoO's)]' 




Finally, substituting (9.49) and (9.50) in (9.48), we find 

> dP 


a = y 


LP dr 


-1 


- 1 r = r s 


where 


r = j U["o^J] 2 - 


(9.49) 


(9.50) 


(9.51) 


(9.52) 


Remembering that u 0 is normalized to unity within the atomic sphere, 
we see that y is the ratio of the value of Uq at the surface of the sphere 
to its mean value (3/4nr^). 

Once u 0 and e 0 have been found for a given value of r s , all that is required 
to determine a for this r s is a single integration of equation (9.35). The 
values of a obtained by Bardeen for sodium and lithium were 1.07 and 
0.65, respectively, for the observed atomic radii. More recently, Brooks 
[1953] has used formula (9.51) in conjunction with the Quantum Defect 
Method (see § 9.7) to calculate a for all the alkali metals, with the following 
results: 



Li 

Na 

K 

Rb 

Cs 

a 

0.69 

1.02 

1.08 

1.12 

1.20 


Reverting to ordinary units (a is, of course, a dimensionless constant), 
the energy e(fc), for small k, becomes 
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s(k) = s 0 + a- 

2m 


(9.53) 


and we may write 


a = m/m*, 


(9.54) 


where m* is the effective mass near k = 0. We see that for sodium, in 
particular, the effective mass is little different from m, again indicating 
that the valence electrons are very nearly free. Since the first Brillouin 
zone is only half filled in the alkali metals, it is probable that the energy 
surfaces in the occupied region do not differ greatly from spheres. Hence, 
although a has been calculated for small values of k only, we might expect 
equation (9.53) to hold approximately for all the occupied states. This will 
be used in the following section. 

9.4. The Cohesive Energy of Metallic Sodium 

In their first paper on metallic sodium, Wigner and Seitz [1933] made 
use of the approximate formula (9.19) for e(k). Assuming that the effects 
of electronic interactions are adequately taken care of by the exclusion 
of all but one electron from an atomic polyhedron, they then postulated 
that the average energy of a valence electron in the metal is simply e 0 + £ F , 
where E r is the free-electron Fermi energy, given in atomic units by 


2.21 

E f = — ryd 


(9.55) 


r, 


[see equation (7.52)]. The graph of e 0 + E F as a function of r s is shown in 
Figure 9.4. According to this theory the value of r s corresponding to the 
minimum of e 0 + E r is the equilibrium atomic radius, which we will denote 
by an d the depth of the minimum below the value of e 0 for infinite 
r s is the theoretical cohesive energy (per atom), which we will denote by S. 
Thus, if / is the (positive) ionization energy of a free atom, we have 



(9.56) 


This procedure is undoubtedly crude, but none the less it cannot be far 
from the truth, for it yields a calculated cohesive energy and equilibrium 
atomic radius for sodium which are within a few per cent of the experi¬ 
ment^ values. A more correct procedure, which we will go on to describe, 
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is to calculate the total energy using a wave function consisting of a single 
determinant of one-electron functions and then to make a correction for 
the effect of Coulomb correlations. This is essentially what Wigner and 
Seitz did in their second paper on metallic sodium [1934]. 

The energy of a monovalent metal containing N atoms has been found, 
using a single determinantal wave function, in § 6.1.3, and is given in equa¬ 
tion (6.18). In that expression the one-electron functions are denoted by 
if/,, where i ranges from 1 to N. Translating this into the ‘k notation’, and 
remembering that at the absolute zero of temperature there are two electrons 
with opposite spins in each occupied orbital state i J/ k , we obtain 


NE 0 = 


2 Z '!'*(*) 


k J 


h 2 N 

2 wi a 


M r ) dr 


+ 2 


Z Z f f— l^*( r i )| 2 W k( r 2)\ 2 dr j dr 2 

k k’ J J r 12 


- Z Z f f— *A*( r i) ftk'( r 2) 2) 4 / k'( r 1) di*! dr 2 

k k- J J r 12 


N N 2 

a±b R ab 


(9.57) 


where we have denoted by E 0 the average energy per electron in this ap¬ 
proximation. Here and in the following work, if there is no indication 
to the contrary, the integrals are throughout the volume of the metal. 
The functions \f/ k are assumed to be eigenfunctions of the equation 

0 (9.58) 

n 


[that is, equation (9.13)], and we will use this equation in order to simplify 
the terms of (9.57). First, it will be useful to define the potential function 
V(r) more precisely. It will be recalled that this is the potential energy of 
an electron due to the ion-core in the cell where the electron is located only. 
Also, in Chapter 6 we defined V 0 (r) as the potential energy of an electron 
in the field of the ion-core whose nucleus has position vector R a , and we 
will refer to the cell containing this ion-core as ‘cell ( a )’. We may then write 

V{r) = Z U a (r), 

a 


(9.59) 
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where 

UJ[r) = V a (r) within cell (a), ( 9>60 ) 

= 0 outside cell (a). 

F 0 (r) is, of course, spherically symmetrical about the lattice point R a , 
and may be written V(\r—R a \). 

We will denote by p{r) the number density of electrons with both kinds 
of spin at position r, that is, 

P(r) = 2 I IMr)l 2 , (9-61) 

k 


assuming that the functions are normalized to unity in the volume of 
the metal. 

Now, using equation (9.58) and the above definitions, the first term in 
expression (9.57) for NE 0 becomes 



2m 


N 


v j + I yjir) 

a 


'l'k(r)dr 


= 2 

= 2 

= 2 


e(k)-V(r)+ Y K(r) 


ij 

Y_s(k)+j f,(r) 

Z e ( fc ) + Z f p( r )l v «(r)- U a (ry]d t 

k a J 


\p k (r)dt 


Z VJ(r)-V(r) 


= 2 Z e(fc) + z f p(f) { v a(r) ~ U a (r)} di 

k a LJ (o) 


+ Z p( r ){K(r)-U a (r)}dr 

b*aJ(b) 


where in the final line the two integrals are throughout cell (a) and cell ( b ), 
respectively. However, from the definition of U a (r), we see that 
{V a ( r )~ U a {r)} is zero within cell (a) and U a (r) is zero within any other 
cell (i b ). Hence, the above expression becomes 


2 Z e ( fc ) + Z Z f p( r ) K(r)dr. 

k a^b J (b) 

We will now make the following additional assumptions: 


(9.62) 
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(i) At all points outside cell (a) we assume that 



In other words, we assume that the potential energy V a (r) is hydrogenic 
outside cell (a). This is a very good approximation, since the effective radius 
of the ion-core is small compared with the atomic radius. Indeed, this ap¬ 
proximation is valid well into the atomic cell — a fact which we shall see 
to be the basis of the Quantum Defect Method. 

(ii) We assume that the valence electron charge distribution within any 
cell (a) is spherically symmetrical, so that, as far as points outside cell ( a ) 
are concerned, the charge can be regarded as concentrated at the centre 
of the cell. This is simply the atomic sphere approximation. 

Using these assumptions, the potential energy of the interaction between 
the ion-core in cell ( a ) and the valence electron charge distribution in cell 
( b ) is found to be 



(9.64) 


“(b) (b) ^al ^ab 

The expression (9.62) for the first term in NE 0 thus becomes 



(9.65) 


The second term on the right hand side of equation (9.57), the Coulomb 
term, becomes 



a b J (a)J (b) r l2 




N N 2 AT p I* p 2 

II— +- —p(rMri) dr,dr 2 . (9.66) 


a*h R ab 2 J ( a )J ( a ) V 12 
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The last two terms in equation (9.57) cannot be reduced. Adding (9.65) 
and (9.66) to these, we find 

NE 0 = 2£e(fc)+-f f — p{r 1 )p(r 2 ) dr l dr 2 
k 2 J(a)J(a) r 12 

- I E IT— ir 2- (9.67) 

k k' *) V 2 2, 

It remains to evaluate the terms in this expression, which can only be 
done approximately for a real metal. If we use the formula (9.53) for e(fc), 
which is a very good approximation for the alkali metals, we obtain 

2£e(fc) = N(e 0 +aE F ), (9.68) 

k 

where E F is again the free-electron Fermi energy, given in equation (9.55). 

We have seen that the wave functions of the occupied states in sodium, 
and, to a lesser extent, in the other alkali metals, are very nearly free- 
electron functions. Let us therefore use the free-electron approximation 
to evaluate the remaining two terms of (9.67). Since these terms have op¬ 
posite signs, it is probable that the errors introduced in this way will partially 
cancel each other. Now, 

if f ~ p(r 1 )p(r 2 )dr idr 2 (9.69) 

d(a)d( a ) V 12 

is just the self-potential energy of a charge distribution of density ep(r) 
within an atomic polyhedron. In the free-electron approximation, 


so that (9.69) becomes 


N 3 

PO) = 1 = —3 7 
v 4 nr s 

(9.70) 

9e 2 f f dr 1 dr 2 
2(47rr s 3 ) 2 J (a) J (a) r 12 

(9.71) 


If we use the atomic sphere approximation the integral can be evaluated 
immediately by the method described in Appendix 4. When the region of 
integration is a sphere of radius r s , we have 



dr { d r 2 
*12 


32n 2 rdr r 2 dr 


32* 2 £ . 


( 9 . 72 ) 
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The Coulomb term in (9.67) is, therefore, 

9 e^N 32te 2 5 _ 3 e^N_ 
2(4nr 3 s ) 2 ~15~ ^ ~~ 5r s 

which, in atomic units, becomes 

1.2 \T A 
— N ryd. 


(9.73) 


(9.74) 


The last term in (9.67), the exchange energy, has already been evaluated 
in the free-electron approximation: it is NE X [see equation (7.53)], which 
is, in atomic units, 


0.916 

r s 


N ryd. 


(9.75) 


Adding (9.68), (9.74), and (9.75), and dividing right through by N, we 
finally obtain the average energy E 0 per electron, as a function of r s , in 
the form 


, , , , . 1.2 0.916 

Eo( r s) = £o( r s)+ a£ F( r s) + —-ryd 

r s 

, . 2.21 0.284 

= £ 0 (r s ) + a —+- r yd» 

r s r s 


(9.76) 


where r s is measured in Bohr units. Apart from the use of the free-electron 
approximation in the Coulomb and exchange terms, the only real approxima¬ 
tion has been the replacement of the atomic polyhedron by a sphere. 
Wigner and Seitz have investigated the latter approximation in detail and 
find that its effects are completely negligible. Of course, E 0 is just the average 
energy given by a total wave function consisting of a single determinant 
of one-electron functions. It is a very good approximation to the Hartree- 
Fock average energy, but yields a cohesive energy which is much too small. 
The calculated value of the cohesive energy of sodium, obtained by sub¬ 
tracting the minimum value of E 0 from —I, is only about 7 kcal/mole, 
compared with the experimental value of 26 kcal/mole. This is under¬ 
standable, for we have already seen that the first two terms of E 0 alone 
(the value of a is so close to unity for sodium that it is immaterial whether 
a is included or not) yield a good value for the cohesive energy. It should 
be noted, in passing, that if we used a total wave function consisting of a 
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single product of one-electron functions, that is, if we used the Hartree 
method, the only difference would be the omission of the exchange energy 
from E 0 . Since the exchange energy is large and negative, E 0 would then 
be very much too high and there would theoretically be no cohesion at all. 
We have previously mentioned, in Chapter 7, that the principal advantage 
of the Hartree-Fock method is that it is an improvement on the Hartree 
method in cohesive energy calculations, but it is clearly not a sufficient 
improvement. 

What E 0 still lacks is an energy term representing the effects of proper 
Coulomb correlations, which are neglected in the Hartree-Fock theory. 
We have seen that this theory accounts in some way for correlation between 
the positions of two electrons with parallel spins, but not at all for cor¬ 
relation between the positions of two electrons with antiparallel spins. 
The missing energy term, which is called the correlation energy, is therefore 
due principally to antiparallel spin correlations. It is not entirely due to 
these, however, for the parallel spin correlations which have been included 
are not proper Coulomb correlations, due to the Coulomb interaction of 
the electrons, but are more or less ‘accidental’ correlations due to the Pauli 
principle. In the following chapter we will describe one method of cal¬ 
culating the correlation energy, in which the Coulomb interaction is taken 
into account before the Pauli principle is applied, and it will be seen that 
the effect of the Pauli principle is then very much smaller. Meanwhile, we 
will devote a section to a general discussion of this energy term, before 
completing our account of the cohesive energy. 

9.5. The Correlation Energy 

The correlation energy may be defined, without any reference to electron 
spin, as follows: 

the correlation energy is the total energy, calculated with proper allowance 
for Coulomb correlations, minus the Hartree-Fock energy. 

We will denote the average correlation energy per electron by W. 

The correlation energy is far more difficult to calculate than any of the 
other energy terms. Unlike the exchange energy, it is extremely difficult 
to calculate even for free electrons. However, for a free-electron gas (and 
hence, to a good approximation, for the valence electrons in the alkali 
metals) several more or less successful attempts have been made, and there 
is sufficient agreement among the various results to make us feel reasonably 
confident about the magnitude of the correlation energy. Before discussing 
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these results, let us see what information we can get without any involved 
calculation. 

We have previously remarked that a very good value of the cohesive 
energy of sodium is obtained by taking the average energy per electron to 
be the first two terms, eo + aEp, of E 0 only. If our final cohesive energy is 
to be right, therefore, it is clear that the correlation energy must roughly 
cancel the Coulomb and exchange terms in E 0 . That is to say, we must 
have, approximately, 


W = — 


0.284 
-ryd. 


(9.77) 


This might be taken as a semi-empirical formula for the correlation energy 
of a free-electron gas for electron densities such as occur in metals. Detailed 
calculation shows that this formula is not far wrong, but it is, of course, 
of little help to us, for in obtaining it we have made an assumption about 
the cohesive energy, which it is our object to calculate. None the less, it 
gives us some idea of what to expect. We note, in particular, that the 
correlation energy is of roughly the same magnitude as the cohesive energy, 
so that it is far from being negligible. 

A quantity which can be calculated without difficulty is the correlation 
energy of a free-electron gas at very low densities , that is, for very large 
values of r s . We know that the kinetic energy is inversely proportional 
to r s 2 , whereas the potential energy of the Coulomb interaction is inversely 
proportional to r % . As r s increases, therefore, the kinetic energy will 
eventually become negligible compared with the potential energy, and the 
electrons will tend to form a stable lattice. When this occurs the total 
energy may be calculated quite simply using the atomic sphere approxima¬ 
tion. Remembering that in the free-electron model the electrons move in a 
uniform distribution of positive charge just sufficient to neutralize the 
electronic charge, we see that the energy per electron at very low densities 
is approximately that of an electron at the centre of a uniform spherical 
distribution of positive charge, of radius r s and total charge e. 

The density of the positive charge is 3e/47tr s 3 . If an electron is at the 
centre of a sphere, of radius r, of positive charge of this density, the total 
charge within the sphere is 


— e+ 


4tc 3 

— r 
3 
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and the potential at the surface is 



The energy required to bring a shell of positive charge, of thickness dr, 
from infinite distance is, therefore, 



e 3e 
r Anri 


Anr 2 dr 



The energy per electron at very low densities is, thus, 



(9.78) 


or, in atomic units, 



(9.79) 


Now, the energy per electron in the Hartree-Fock approximation [see 
equation (7.54)], if the kinetic energy is again ignored, is just the exchange 
energy, 

0.916 

-ryd. 


The average correlation energy per electron at very low densities is, there¬ 
fore 


1.8 0.916 0.88 

W =-+ ---ryd 


(9.80) 


(the approximations used do not warrant the retention of more than two 
figures in the numerator). 

This is not a semi-empirical result, but a. straight-forward theoretical 
one which is correct for a free-electron gas at very low densities. It must 
be remembered, however, that the valence electrons in a metal only ap¬ 
proximate to a free-electron gas at the densities normally occurring in 
solid metals (r s lies between 3 and 6 B.u. for the alkali metals). When 
the interatomic spacing is very large, that is, when we have virtually a 
system of free atoms, the valence electron wave functions are very dif¬ 
ferent from free-electron functions. The above formula therefore has no 
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direct application. None the less, it is extremely useful in correlation energy 
calculations, for a reason which will become apparent. 

A possible method of calculating the correlation energy which im¬ 
mediately suggests itself is by perturbation theory. The essential terms 
in the Hamiltonian of a free-electron gas are 



(9.81) 


What has been omitted is a term representing the interaction of the electrons 
with the neutralizing uniform distribution of positive charge and the self¬ 
energy of this charge. If the second term were not present the eigenfunctions 
of (9.81) would be merely determinants of free-electron functions. It might 
therefore seem a simple matter to account for the second term, which 
includes the effects of all correlations, by perturbation theory. This has 
indeed been done, to various degrees of accuracy, but the task is by no 
means an easy one. This is because, although the first-order correction, 
which consists of the exchange energy and the Coulomb energy [the latter 
is cancelled by the two terms omitted from (9.81)], is obtained without 
difficulty, the second-order correction is found to diverge. A convergent 
result can only be obtained by including higher-order perturbation correc¬ 
tions, and, in fact, summing the whole perturbation series — this should, 
of course, give the energy with complete accuracy (see § 3.8), although 
certain approximations are required in practice. 

The straightforward perturbation calculations are only valid at high 
densities (r s < 1 B.u.), so that in order to obtain from them the correlation 
energy at actual metallic densities it is necessary to interpolate between the 
high density and low density limits — hence the utility of formula (9.80). 
The earliest interpolation formula of this sort was obtained by Wigner 
[1934, 1938], which gave t 



(9.82) 


For sodium, with r s = 4 B.u., this gives W = -0.075 ryd, which is very 


t The formula originally given by Wigner was W = — 0.576/(r 8 +5.1) ryd, but this, 
as Wigner [1938] himself pointed out, was based upon an incorrect low density limit. 
Formula (9.82) was given by Pines [1955] — it agrees with Wigner’s result in the limit 
as r e -* 0, and with formula (9.80) for very large r B . Since Wigner claimed an accuracy 
of only 20 per cent for his formula, the correction resulting from the use of (9.82) is 
hardly significant. 
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close to the value -0.071 ryd, given by the semi-empirical formula (9.77). 
Wigner’s formula has had so much success in cohesive energy calculations 
that it seems unlikely to be greatly in error, and more recent work bears 
this out. Gell-Mann and Brueckner [1957], for example, have recently 
made a calculation of the correlation energy of a free-electron gas which 
they claim to be perfectly accurate in the high density limit. This gives 

W = 0.0622 log r s -0.096 + 0(r s ) ryd, (9.83) 

the last term being very small and tending to zero with r s . This formula is 
certainly wrong for low densities and for the densities occurring in actual 
metals, but for r s = 1 B.u., ignoring the term 0(r s ), it gives W = -0.096 
ryd, whereas formula (9.82) gives W = —0.100 ryd. 

There have been other calculations of the correlation energy of a free- 
electron gas, based upon perturbation theory of infinite order, which we 
will not enumerate. In the following chapter, however, we will describe a 
method, due to Pines, which involves only second-order perturbation 
theory, because it uses the concept of plasma oscillations to reduce the 
effective range of the Coulomb interaction. The formula obtained by Pines, 
which is believed to be valid in the region of actual metallic densities, 
although not for very high or for very low densities, is 

W = 0.031 log r s —0.115 ryd. (9.84) 

This gives values very close to those of formula (9.82) for r s between 2 
and 6 B.u. — for sodium, in particular, it gives W = —0.072 ryd. 

9.6, Further Remarks on the Cohesive Energy 

The final expression for the average energy per electron of an alkali 
metal, which we will denote by E, is thus 

E(r s ) = e 0 (r s ) + a ^ + — + W(r,) ryd, (9.85) 

r r 

' s ' s 

where W(r s ) is given by either (9.82) or (9.84). If we again denote by r 0 
the atomic radius corresponding to the minimum value of E, then the 
cohesive energy per electron is 

5= —I—E(r 0 ) ryd, (9.86) 

where I is the ionization energy of a free atom. The calculated values of 
the cohesive energies and equilibrium atomic radii of the alkali metals, 
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obtained in this way, are within a few per cent of the experiments, values. 

Other quantities which can be directly calculated from equation (9.85) 
are the pressure p and compressibility k, at the absolute zero of temperature, 
for any value of r s . If Q is the volume of an atomic sphere, the pressure is 
given by 


P = 


d E 
d Q ’ 


(9.87) 


and the compressibility by 


Thus, 


or, since Q = jicr s 3 , 


1 

Q dp 


1 

K 


= = 

d £ 

£2 , 


dQ 2 

1 d 2 £ 

1 dE 

127ir s dr 2 

6nrf dr s 


(9.88) 


(9.89) 


(9.90) 


When r s = r 0 , the equilibrium atomic radius at zero temperature and 
pressure, we have dE/dr s = 0, and 



The compressibilities of the alkali metals calculated by means of this 
formula again agree very well with the experimental values. 

Equation (9.85) does not apply directly to the other monovalent metals, 
copper, silver, and gold, even though the valence electron wave functions 
in these metals approximate almost as closely to free-electron functions as 
they do in the alkali metals. This is because the ion-cores (and, in particular, 
the outermost d sub-shells) in copper, silver, and gold are so large that 
neighbouring cores overlap appreciably. It is therefore not permissible 
to treat the interaction of the ion-cores as if they were point charges,.as we 
did in obtaining equation (9.85). Strictly speaking, the outermost d 
electrons in these metals should be treated in the same way as the valence 
electrons (see § 8.12), but, as the d electrons are by no means free, such a 
calculation is hardly possible at the present time. Some success has been 
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obtained by simply adding on to E, as given in equation (9.85), a term 
representing the additional energy due to the overlapping of the ion-cores, 
which has been estimated using the Thomas-Fermi approximation; but we 
will not go into this. 

Equation (9.85) applies only to monovalent metals (with small ion-cores), 
but it can easily be generalized to metals of any valency Z. If e 0 (r s ) is again 
the eigenvalue of equation (9.10), subject to the same boundary condition 
(9.8), with V(r) now the potential function of the Z-times ionized atom t, 
wc obtain, for the average energy per electron, 



r, 


r. 


* S 


The kinetic, exchange, and correlation energies are obtained, in the free- 
electron approximation, from those of a monovalent metal simply by 
substituting Z~*r s for r s . The Coulomb energy per atom is still given by 
expression (9.69), but now, in the free-electron approximation, 



(9.93) 


so that the total Coulomb energy is given by (9.74) multiplied by Z 2 . 
This leads to the expression 1.2Z/r s for the Coulomb energy per electron. 

We should, of course, expect equation (9.92) to be a very much worse 
approximation for a polyvalent metal, whatever the size of the ion-core, 
than it is for an alkali metal. This is because the electrons in a polyvalent 
metal occupy states in more than one Brillouin zone, and equation (9.53) 
for e(k) is only expected to hold near the centre of the first zone. Nor is 
it likely that the free-electron approximation will apply so well to the 
electronic interaction terms. None the less, remarkably good results have 
been obtained by this method (see Brooks [1958]) for polyvalent metals 
with small ion-cores, such as magnesium and aluminium. This suggests that, 
for such metals, in spite of the effects of the zone boundaries, the valence 
electrons are, in some respects, still very nearly free. 


* The use of this potential function cannot be justified by the effects of Coulomb 
correlations, since it is clear that in the case of a polyvalent metal there must be on the 
average Z electrons in an atomic polyhedron. The justification is the fact, mentioned 
previously, that the wave function y> 0 has been found to be virtually the same whether 
or not the self-consistent field of the electrons is included in V(r). 
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9.7. The Quantum Defect Method 

The purpose of the Quantum Defect Method is to obtain the energy 
levels of a valence electron in a metal directly from the observed energy 
levels in a free atom, without the necessity of constructing an ion-core 
field V(r). In order to give the bases of the method in as simple a form as 
possible, we will confine our discussion to the state \}/ 0 of lowest energy 
in the metal, although the method has been applied to other states. 

We have remarked in § 9.4 that, outside the effective radius of the ion- 
core, which is small compared with the atomic radius, the potential function 
V(r) of a monovalent metal ion is very nearly hydrogenic — that is to say, 
for practical purposes, 

V(r) = —e 2 /r. (9.94) 

This means that, outside the ion-core, and hence in the region where it is 
required to fit the boundary condition (9.8), equation (9.10) is just the 
hydrogen atom equation 



or, in atomic units, 

^ + g ^ + (e„+ -) = 0. (9.96) 

dr r dr \ r/ 

The solutions of this equation are known in analytical form, so that it would 
seem an easy matter to fit the boundary condition (9.8). The difficulty is, 
however, that for each value of e 0 there are two independent solutions, 
the general solution being ah arbitrary linear combination of the two. 
The problem is thus to obtain the correct linear combination, which will 
join smoothly to the wave function inside the core for the same value of 
e 0 . It has been found possible to do this without knowing anything about 
the wave function inside the core, by making use of the experimentally 
determined quantum defect. 

The energy levels of a hydrogen atom are given, in atomic units, by the 
formula 

e — —\ ryd, (9.97) 

n 

where n is an integer called the principal quantum number. It is found that 





278 


THE METHOD OF WIGNER AND SEITZ 


[Ch. 9, § 7 


the energy levels of the s states of the valence electron in an alkali atom 
can be expressed by the formula 



(9.98) 


where n is again the principal quantum number, and <5, which varies only 
slightly with n, is called the quantum defect (the same formula applies to p 
states, d states, and so on, with different values of the quantum defect, 
but in order to find fi/ 0 we need only consider s states). The quantum defect 
is a smooth and slowly varying function of e, which may be expressed very 
accurately by a polynomial of low degree, such as 


<5 = a + bs + cs 2 


(9.99) 


(for sodium, the values of the coefficients are, roughly, a = 1.35, b = —0.06, 
c = 0.01). The experimental values of e form a discrete set, of course, but 
having obtained equation (9.99) we may use it to define the quantum 
defect for any value of e. The energies s 0 in which we are interested lie lower 
than the energy of the normal state of a valence electron in a free atom, 
which is the lowest energy for which we have an experimental value of <5, 
so that it is necessary to use equation (9.99) as an extrapolation formula. 

Now, the general solution of equation (9.96), for a given s 0 , may be 
written 


Mr) = AF(r) + BG(r), 


(9.100) 


where A and B are arbitrary constants and F{r) and G{r ) are known, in¬ 
dependent solutions. These solutions may be chosen in any number of 
ways, but we assume here that two particular functions have been chosen, 
which we will not specify in detail, except to say that F(r) is finite at the 
origin while G(r ) becomes infinite there. We know that, when e 0 is one 
of the free-atom energy levels, the correct linear combination of F(r) and 
G{r) must tend to zero as r tends to infinity. The function i^ 0 (r) given by 
equation (9.100) then represents the free-atom wave function outside the 
ion-core, for this particular s 0 , and so must necessarily join smoothly 
to the (unknown) wave function inside the core. The correct ratio B/A 
in this case is found, by analysis which is much too complicated for in¬ 
clusion here, to be 


B 

— — — tan nd. 
A 


(9.101) 
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It is then assumed that, for all the values of e 0 in which we are interested, 
the ratio BjA for the function ^ 0 {r) outside the core, which joins smoothly 
to the true wave function inside the core, is given by the same formula, 
when <5 is the extrapolated quantum defect defined by equation (9.99), 
with e = e 0 . The value of r s , lying outside the ion-core, for which the 
derivative of t p 0 (r) vanishes, can then be found directly from (9.100). In 
this way, e 0 is obtained as a function of r s without ever solving the Schro- 
dinger equation inside the ion-core. Although the values of the cohesive 
energies and equilibrium atomic radii of the alkali metals given by this 
method are in remarkably good agreement with experiment, the assumptions 
involved require further justification. For such justification, and a more 
detailed description of the method, interested readers are referred to the 
articles by Ham [1955] and Brooks [1958]. 

The quantum defects of the p states of a valence electron in a free atom 
have also been used, with a modification of the foregoing method, to solve 
equation (9.35) for PO) outside the ion-core. This permits the quantity 
a to be found from equation (9.51), which only requires a knowledge of 
P(r) in the vicinity of the surface of the atomic sphere. There appears to be 
one difficulty here, however, for the calculation of the factor y requires 
that u 0 (r) be normalized, and this would seem to require a knowledge of 
u 0 (r) throughout the whole atomic sphere. It is an interesting fact that 
this is not so, provided « 0 (r) is known, outside the ion-core, for a range of 
values of £ 0 . At a given value of r, it is clear that the value of the function 
u 0 (r) will vary with e 0 , and hence it may be differentiated with respect to 
e 0 . Silverman [1952] has proved that, with/(r) = ru 0 (r) as before, 





(9.102) 


This integral determines the normalization of u 0 (r), and we see that its 
evaluation only requires a knowledge of how the function f(r) varies with 
£ 0 and with r outside the ion-core. 
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CHAPTER 10 


PLASMA OSCILLATIONS IN METALS 


10.1. Introduction: Qualitative Description of Plasma Oscillations 

In the previous chapter we showed how important it is to take proper 
account of Coulomb correlations in calculating the cohesive energy of a 
metal, but we also noted how difficult it is to do this, even in the case of 
free electrons. The principal difficulty is due to the fact that the Coulomb 
interaction of the electrons is so strong and of such long range that its effects 
cannot be calculated by ordinary perturbation theory. On the other hand, 
we saw in Chapter 7 that, as far as properties which depend only upon the 
density of states in energy are concerned, very good results can be obtained 
by treating the electrons as non-interacting particles — or, in the Hartree 
theory, by considering only the average effect of the electronic interaction. 
This independent particle approximation is the basis of nearly all band 
structure calculations — in our account of Brillouin zone theory it is im¬ 
plicit in the assumption [see equation (8.71)] of the same potential function 
V(r) for all one-electron states. 

During recent years a theory of the interaction of electrons in metals 
has been developed which offers a simple justification of the independent 
particle approximation, and, at the same time, permits the correlation 
energy to be calculated using ordinary second-order perturbation theory. 
This is the theory of plasma oscillations in metals, due to Bohm and Pines 
[1951, 1952, 1953], Pines [1953], Although the qualitative ideas of the 
theory are now widely accepted, the detailed quantum-mechanical develop¬ 
ment is still in some dispute, so that the subject-matter of the present chapter 
is rather more speculative than what has gone before. None the less, the 
basic principles of the theory are so simple, and its results are of such 
practical importance, that it has been thought worth while to include a 
brief account of this theory in the present work. A more complete account, 
which is still not too difficult, may be found in an article by the author 
(Raimes [1957]). 

Any highly ionized gas, that is, a system composed of a large number 
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of positive ions and virtually free electrons, with zero total charge, is called 
a plasma. Such plasmas, which occur in gas discharge tubes, have been 
studied both theoretically and experimentally for many years, and interest 
in them has lately intensified owing to their use in thermonuclear devices. 
However, it is clear that a metal may also be regarded as a plasma, for we 
have seen that a metal is a system of positive ions arranged on a crystal 
lattice, together with valence or conduction electrons which are more or 
less free to travel throughout the lattice. We know that the ions in a metal 
have a slight vibrational motion, but this may be neglected for most purposes 
and the ions regarded as static. Although it is formally possible to include 
the ion-core field of a metal in the quantum mechanical theory of plasma 
oscillations, accurate numerical results have so far been obtained only in 
the case of a free-electron gas, and we will restrict ourselves to this — 
we have seen in the previous chapter that, as far as the electronic inter¬ 
action terms are concerned, the free-electron approximation may be expected 
to apply reasonably well to a number of metals. In this approximation the 
system of ion-cores is replaced by a uniform distribution of positive charge, 
which we will refer to as the background of positive charge, whose density 
is equal to the average electronic charge density, but of opposite sign. 

The charge density of the positive background and electrons together is 
on the average zero, but the electrons are always in motion (we refer to 
this as the thermal motion, although it occurs even at the absolute zero of 
temperature), so that the electron density is, in fact, continually varying 
throughout the plasma. Now suppose that, as a result of a chance fluctuation 
in the thermal motion, the electronic charge density in some region is 
reduced below the average. The positive background is no longer neutralized 
in that region and the resulting positive charge attracts neighbouring elec¬ 
trons. The tendency is to restore charge neutrality, but the attracted electrons 
acquire momentum and so overshoot the mark — that is to say, more 
electrons accumulate in the region than are necessary to neutralize the 
positive charge. The excess negative charge thus created repels electrons 
outwards again, and so oscillations are set up. These longitudinal oscillations 
of the electron gas, analogous to sound waves, are called plasma oscillations. 

The approximation of replacing the ions by a uniform distribution of 
positive charge has also been used in the case of gas discharge plasmas. 
There is, however, an important difference between gas discharge plasmas 
and metallic plasmas. In the former the density (of the order of 10 12 /cc) 
of the electrons is so small that their motion may be treated by classical 
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mechanics or Maxwell-Boltzmann statistics. The motion of the high density 
(of the order of 10 23 /cc) electron gas in a metal, however, must properly 
be treated by quantum mechanics or Fermi-Dirac statistics. None the less, 
a useful qualitative picture of plasma oscillations in metals, and, as we 
shall see, even reasonable quantitative results, can be obtained by classical 
methods. 

10.2. Elementary Classical Theory: The Plasma Frequency 

Let us denote by n the average density of electrons, so that the average 
electronic charge density is —en, and the uniform charge density of the 
positive background is en. In order to obtain the frequency of plasma 
oscillation in as simple a way as possible, let us at first neglect the thermal 
motion of the electrons. We assume that the electrons are initially at rest and 
distributed with uniform density n, and are then displaced radially outwards 
from some point in the plasma, which we take to be the origin O. The 
displacement £(r) is a function of r, the distance from O, and we will suppose 
this function and its gradient to be everywhere small. As a result of the 
displacement the number of electrons which leave a sphere of radius r, 
centred at O, is very nearly 4nnr 2 £(r) — the number originally contained 
within a shell of thickness £(/•) at the surface of the sphere. Owing to the 
positive background, therefore, the sphere is left with the charge 4nenr 2 £(r). 
The electric field at distance r from O is consequently of magnitude 4nen^{r) 
and directed radially outwards, so that each electron experiences a force 
— 4ne z n£(r) radially outwards. This force must be equated to m|(r), where 
m is as usual the mass of an electron and i(r) is its outward acceleration, 
so that 



( 10 . 1 ) 


This is the equation of simple harmonic motion, and tells us that the 
electrons will oscillate about their equilibrium positions with angular 
frequency 



( 10 . 2 ) 


which is called the plasma frequency. For a typical gas discharge plasma 
co p is of the order of 10 10 radians/sec, whereas for a metal it is of the order 
of 10 16 radians/sec. 
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Of course, we have ignored the thermal motion of the electrons in the 
foregoing work, except in so far as thermal fluctuations cause a departure 
from uniform electronic density and hence initiate plasma oscillation, 
but the result remains plausible if we regard the displacement as being 
a ‘drift’ displacement superimposed upon the random thermal motion. 
When proper allowance is made for the thermal motion it is found that 
the angular frequency of oscillation co is very nearly ca p for long waves, 
but increases as the wavelength X decreases. The dispersion relation obtained 
by classical methods is, approximately, 

co 2 = cup+/c 2 <i;f>, (10.3) 

where k(= 2n/X) is the wave number of the plasma wave and <u?> is the 
average of the squared velocities of the electrons. The dispersion relation 
given by the quantum-mechanical theory is the same as this as far as terms 
of the second order in k. 

In fact, the dispersion is not very great, a> never differing from <o p by 
more than a few per cent, because there is an upper limit to k, which we 
will denote by k c , for wave numbers above which plasma oscillations do 
not occur. It will be seen in the following section that the effect of the 
thermal motion is to disrupt the plasma oscillations and to prevent their 
occurrence with short wavelengths, so that there is a lower limit to the 
wavelength of plasma oscillation. That this must be so also follows simply 
from the corpuscular nature of the electron gas. It is meaningless to speak 
of organized oscillations of the latter with wavelengths smaller than the 
average interelectronic distance, which is a few angstrom units in metals. 
We therefore expect k c 1 to be of the order of lA, and this is borne out by 
calculation which takes into account the effect of the thermal motion. 

10.3. A More Detailed Classical Theory 

It is not our intention in this book to give a complete description of the 
quantum-mechanical theory of plasma oscillations, although we shall use 
the results of this theory and attempt to make them plausible. It will be 
Jielpful in the latter task if we first present a more detailed classical theory 
than that given above. This introduces several concepts which are used 
in the quantum-mechanical theory, and also clearly demonstrates the 
disruptive effect of the thermal motion on the plasma oscillations. 

It is convenient for purposes of computation to assume that the metal 
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is in the form of a cube of unit volume. In our free-electron model, then, 
we have n electrons in a cubic box of unit volume, which also contains 
a uniform distribution of positive charge of density en. It is our object to 
show that the Fourier coefficients of the electron density oscillate approxima¬ 
tely with angular frequency <u p , provided the wavelength is sufficiently 
long. To do this we must first of all expand the potential energy 6f the 
electrons in Fourier series within the box. 

We write the potential energy due to the Coulomb interaction between 
the ith and y'th electrons as follows: 


- = Ic.e"" 1 "-"*, 


(10.4) 


where 


k = 2n(n 1 e 1 +n 2 e 2 + n 3 e 3 ), 


e x , e 2 , e 3 being unit vectors along three mutually orthogonal edges of the 
cube, and n x , n 2 , n 3 being integers — the sum is thus a triple sum over all 
the positive and negative integral values of n 1 , n 2 , n 3 , including zero. 
To determine the coefficients c k we multiply both sides of equation (10.4) 
by exp [i k' • (r y — r f )] and integrate with respect to r i over the volume of 
the box; thus 

k'-(rj-ri) f 

-d rj = X cj e ,(fc - fc ' ) - (r,-r ' ) di> (10.5) 

r;; k J 



The integral on the left-hand side is similar to that appearing in the ex¬ 
pression (7.36), and may be evaluated in the same way. We find 


I 


*ifc' • O j-n) 


, 471 

dr: = — 
J k’ 2 


( 10 . 6 ) 


The integral on the right-hand side of (10.5) is elementary, and has the 
value zero, unless k = k’, when it has the value unity, since the box is of 
unit volume. We have, finally, 


c k — 



(10.7) 


except when k = 0. Integration of equation (10.4) gives immediately 



( 10 . 8 ) 
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which is just the potential energy of the zth electron due to a uniform 
distribution of one electronic charge throughout the box. We may thus 
write the total potential energy of the zth electron, due to its interaction 
with all the other electrons and with the positive background, as 


U(r,) = I r 

j±i k 


4ne 2 


\k • ( r t -rj) 

C 5 


(10.9) 


the prime indicating that the terms with k = 0 are excluded, these terms 
being cancelled by the effect of the positive background. 

The total force + exerted on the zth electron is 


- grad 


i U(r,) = - («, 


d d 

- V&2 —~ 

dx t dy, 



and this may be equated to mv h where v t = r { is the velocity of the electron. 
We find 


»! 


4ne 2 i 

m 


H'pe 
J k k 


i k 


( n-rj) 

5 


( 10 . 10 ) 


in which it is unnecessary to omit explicitly the term with j = z, since 
this is automatically zero owing to the symmetrical distribution of the k 
vectors. 

Now let us consider the actual distribution of electronic charge at a given 
instant, by which we mean the discontinuous distribution represented by 
the instantaneous positions of all the electrons, and not a local volume 
average, which is usually implied. Regarding the electrons as point particles, 
the actual electronic charge density may be written — ep{r), where 

p(r) = Y J K r ~ r i)> (10.11) 

i 

S(r — ri ) being the Dirac (i-function 1 " 1 ' and the sum being over all the electrons. 


T It should be noted that here, as in the calculation of the cohesive energy, we are 
neglecting magnetic forces, which are of a very much smaller order of magnitude than 
the electrostatic forces. 

tt This is the three-dimensipnal form of the function defined on p. 214. It may be 
defined similarly by the relations <5(r— r t ) = 0, for r^r iy and J(5(r— r { )dr = 1, the in¬ 
tegration being throughout any region containing the point r,. The principal property 
of this function is expressed in the equation 

|/(r)<5(r-r,.)dr =/(r,), 

where /(/*) is an arbitrary function and the integration is again throughout any region 
containing /%. 
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The function p(r), which we will call the particle density [the number of 
electrons in any region is obtained by integrating p(r ) throughout the 
region], may also be expanded formally in Fourier series within the box. 
Thus, we write 

p(0 = ZPk eifcr ’ (10.12) 

k 

where 

p t =jp(r)c -<--dr = Ie-“ (10.13) 


p 0 , the Fourier coefficient with k = 0, is just the average electron density 
n, and the other coefficients represent fluctuations about this average 
density. In terms of these density fluctuations equation (10.10) becomes 


. Anc i k ik • n 

v i = L ,2^ e • 

m k k 

(10.14) 

Now, from equation (10.13) we have 


A - -il(* 

i 

(10.15) 

and 


h= -Z[(fe-^) 2 +ife-« i ]e- ifc - ri . 

i 

(10.16) 

Substitution of v { from equation (10.14) gives 


Ph = -E(kv,) 2 c '■ -II p„.t 

i TYl i k' k 

i(k'-k) • r t (10.17) 

The last term of this equation may be split into two parts, consisting of 
those terms of the double summation with k' = k and those with k # k 

respectively, as follows: 


4ne 2 n 4 ne 2 k ■ k’ 

Pk 2 j , , 2 Pk'Pk-k'- 

m m k'*k k 

(10.18) 


We expect the Fourier coefficients p k , for k + 0, to be very small compared 
with n, since there is a large number of electrons and they are more or less 
randomly situated. The phase factors exp( — ik • r { ) are complex numbers 
of unit modulus, and p k is thus the sum of a large number of unit vectors 
with random directions in the complex plane — for each vector with a 
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given direction there is one with very nearly the opposite direction, so that 
the sum must generally be small. It is also clear that the time average of 
p k is zero. It is reasonable to suggest, therefore, that the second term in 
(10.18), which is a sum of products of two Fourier coefficients, will be 
small compared with the first, at least for small values of k. Admittedly, 
this is not entirely obvious, for the second term is an infinite sum, but 
detailed consideration (see Bohm and Pines [1952]) shows that it is indeed 
/ery small compared with the first term, for the small values of k which 
will concern us, and we will neglect it. This is called the random phase 
approximation by Bohm and Pines, and is also a central feature of their 
quantum-mechanical theory of plasma oscillations. 

Using the random phase approximation, then, and formula (10.2) for 
cjp, equation (10.17) becomes 

Pk= ~ ■ v { ) 2 e~ ik ' n -co 2 p p k . (10.19) 

i 

The first term on the right does not involve the accelerations and would 
be present even if the electrons did not interact — it arises simply from the 
thermal motion of the electrons. The second term represents the effect of 
the electronic interactions. It is clear that, for sufficiently small values of 
k, the first term, which contains the factor k 2 , may be neglected in comparison 
with the second, giving 

Pk + ^lPk = 0 . ( 10 . 20 ) 

Thus we see that, for sufficiently long wavelengths, the Fourier coefficients 
of the electron density oscillate with the plasma frequency; which is another 
way of expressing the result obtained in § 10.2. Equation (10.19) tells us 
more than this, however, for it shows explicitly the effect of the random 
thermal motion of the electrons, and enables us to fix an approximate 
upper bound to the wave number of plasma oscillation. The equation may 
be written 

Pk = ~ Z [(* • v i ) 2 + co 2 p ^~ ik ' ri , (10.21) 

i 

and we deduce that plasma oscillation may occur provided the wave number 
k is such that 

(o p > kv o, (10.22) 

where v 0 is the velocity at the top of the Fermi distribution, that is, the 
maximum velocity of any electron at the absolute zero of temperature. 
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In other words, using the notation introduced in § 10.2, we may write, 
very roughly, 

K ~ <w p /»o- (10.23) 

This gives a value of k ~ 1 for a typical metal of about an angstrom unit, 
in agreement with our previous statement. 

It must be remembered that classical methods, which are not strictly 
applicable to the electrons in a metal, have been used in the foregoing work. 
None the less, the results of the quantum-mechanical theory of plasma 
oscillations are found to be in essential agreement with those obtained 
above. Of course, (10.23) only provides a very rough upper bound to the 
wave number of plasma oscillation. In fact, owing to the presence of the 
first term on the right-hand side of equation (10.19), the density fluctuations 
p k will never quite oscillate harmonically, even when k is small. However, 
it is possible to split p k into two independent components, one of which 
represents the collective motion, that is, the plasma oscillations, and the 
other represents the random thermal motion. The first component predo¬ 
minates at long wavelengths, satisfying the condition (10.22), and, within 
the random phase approximation, is found to oscillate harmonically with 
angular frequency given approximately by the dispersion relation (10.3). 
We will not go into this in detail, as the treatment we have given is sufficient 
for our purposes — we have already remarked that, in any case, the dis¬ 
persion is small. 

10.4. Elementary Quantum Theory: The Excitation Energy of a Plasmon 

We have seen in the foregoing section that the plasma oscillations can be 
represented approximately by a finite set of harmonic oscillators with angular 
frequency co p , and these may be thought of, roughly, as the density fluc¬ 
tuations p k with k < k c . According to elementary quantum theory, as shown 
in Chapter 1, the energy of such an oscillator must have one of the values 
given by the expression (jV+x)/ico p , where A is a positive integer or zero. 
Pines has given the name plasmon to a quantized plasma oscillator, and we 
shall make use of this in .the following work. The state with N = 0, or 
ground state, has energy \ha> p , which is the zero-point energy of the plasmon. 
The energy required to raise the plasmon from one state to that of next 
higher energy, the excitation energy, is ha> p . The success of the plasma 
theory as a computational tool depends crucially upon the magnitude of 
fi(o p , which we will now calculate. 
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The atomic radius r s is defined by t 


An 3 1 


(10.24) 


Thus, equation (10.2) gives 


col = 

P 


Ane 2 n 3e 2 


(10.25) 

m mr“ 

If, now, r s is measured in Bohr units and energy in rydbergs, we obtain 

_ 2^3 


hco p = ryd. 
r? 


(10.26) 


For sodium, with r s = 4 B.u., this gives hco p = 0.433 ryd = 5.9 eV. In 
general, ior the valence electrons in all metals, hco p lies between about 3 
and 25 eV. 

It is more significant to compare hco p with Co, the maximum kinetic 
energy of any electron at 0°K, according to the Sommerfeld-Hartree theory 
of a free-electron gas. Equation (7.11) gives 


Co 


2m \ 4 / r 2 


(10.27) 


which becomes, in atomic units, 

„ 3.68 

to = —r ryd. 

r s 

Thus, from (10.26) and (10.28), 

tico p = 0.94r±e o , 


(10.28) 


(10.29) 


when r s is measured in Bohr units. 

Now 0.94 rf > 1 for all metals, so that hco p > Co- We have seen in 
Chapter 7 that, at normal temperatures T, only a few electrons in states 
near the Ferrrii surface are excited and their thermal energies are of the 
order of hT, which is very much smaller than Co- We may therefore say 


♦ As explained in § 7.2, this is strictly true only for a monovalent metal — for a metal 
of valency Z we must replace r s , in (10.24) and any formula derived from it, by Z~i r e . 
However, in work on electronic interactions it is customary to define r a by -equation 
(10.24) regardless of the valency of the metal to which application might be made (n 
always being, of course, the valence electron density). This point must be borne in mind 
when making numerical calculations using published tables of atomic radii. 
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that hco p , the excitation energy of a plasmon, is very much greater than 
the thermal energy which any electron is able to give away at normal 
temperatures. This means that thermal excitation of plasmons can usually 
be neglected and the plasmons will remain in their ground states unless 
excited by some other method, such as the passage of a fast charged particle 
through the metal (see § 10.9). 

The foregoing is the most important fact in the plasma theory. The practical 
utility of the theory depends upon it, for it means that the plasmons take 
no active part in many electronic processes and may often be ignored. 
Now, plasma oscillation is an organized, motion of a large number of 
electrons and does not occur with wavelength smaller than a few angstrom 
units. It may be inferred from this that the long-range part of the Coulomb 
interaction is responsible for the plasmons, so that in calculations where 
the latter may be ignored the former may also be ignored. We shall see in 
the following section that the remaining interaction has an effective range 
of about k ~ 1 , which is of the order of 1 A, and this is so short that reasonable 
results might often be expected by ignoring this interaction also, that is, 
by treating the electrons as non-interacting particles. The plasma theory 
thus offers a justification of the independent particle approximation, which 
has been used with much success in the theory of metals. 

10.5. The Short-Range Interaction 

The total potential energy due to the Coulomb interaction of the electrons is 



(10.30) 


According to equation (10.9), this potential energy, together with that due 
to the background of positive charge, may be written 


i i*j k K 


i k ■ ( ri-rj ) 


(10.31) 


Now, from equation (10.13), we have 


\k • ( ri-rj) 


(10.32) 


i j 


k k 


so that (10.31) becomes 


(10.33) 
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the term —n being due to the fact that i / j in (10.31). The work of § 10.3 
tells us that the density fluctuations p k , with k < k c , are described by the 
plasmons. This suggests, as is confirmed by the quantum-mechanical 
theory, that, after the plasma oscillations have been accounted for, the 
potential energy due to the residual interaction, which we denote by i/ s r , 
consists of those terms of (10.31) with k > k c ; thus 


H,, -HI 

i^j k 
(k>k c ) 


2ne 2 


Jk ■ ( r t — rj ) 


(10.34) 


In other words, comparing (10.30) and (10.34), the potential energy of 
two electrons a distance r apart is no longer e 2 /r, but 



(fc>fcc) 


e 


ifc • 


r 


(10.35) 


If we replace the sum by an integral, remembering that the density of 
representative points in k- space is I/ 871 3 , since the metal has unit volume, 
(10.35) becomes 


where 


with 


2n 2 


4 e ikr d k 

, b - 2 

(fc>fcc) K 


2 poo pTi 

= - dfc e iftrcos0 sin Odd 

71 J k c J 0 


2e 2 f 00 sin kr e 2 

- — , ik = - F(k c r), 

71 J kc kr r 

(10.36) 

KKr) - - r Sin X d.x = 1 - - Si (k c r), 
nJ kcr X n 

(10.37) 

Si 00 = f sin x dx. 

J 0 X 

(10.38) 


The function Si(y) cannot be evaluated analytically, except for y = 0, 
when it has the value zero, and in the limit as y oo, when it has the 
value n/2. However, tables of values exist, and when these are used the 
graph of the function F(k c r ) is found to be as shown in figure 10.1. The 
graph of the function exp( — k c r) is also shown for comparison. For values 
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of k c r less than 2 the functions are very similar, falling rapidly from unity 
at the origin to nearly zero. At 
larger values of kj the function 
F(k c r ) oscillates with decreasing am¬ 
plitude, but these oscillations are 
negligible when multiplied by e 2 /r 
— they have no physical significance, 
but are due to the assumption that 
the plasmons cease abruptly at 
k = k c , which rather oversimplifies 
the problem. We see, then, that the 
short-range interaction, that which 
remains when the long-range inter¬ 
action has been accounted for by 
the plasmons, is a screened Coulomb 
interaction and the effective range, 
or screening distance, is roughly k~ l , that is, one or two angstrom units. 

The total potential energy (10.34) due to the short-range interaction may 
be written 

= (lo- 39 ) 

i*j r y 

In the following section we will calculate the exchange energy using this 
expression instead of the full Coulomb interaction (10.30). 

Of course, a screening of the charge of an electron is expected simply 
as a result of the polarization of the surrounding plasma, that is to say, 
the repulsion of neighbouring electrons leaving an excess positive charge 
around the given electron. We calculated this screening by the Thomas- 
Fermi method in § 7.10.1. According to equations (7.81) and (7.83), the 
potential at distance r from a given electron is 

(10.40) 

r 

where 

A = 0.64r* B.u. (10.41) 

The screening function is thus of the exponential type, which was shown 
above to be very similar to the screening function of the plasma theory. 
However, we shall see that the screening distance A, 0.68 A for sodium, is 



Fig. 10.1. The full curve represents the 
function F(/c c r), equation (10.37), and the 
broken curve the function exp (—k c r). 
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about one half the value of k ~ 1 required by the plasma theory, so that the 
Thomas-Fermi method rather exaggerates the screening effect of the 
electron gas. 

10.6. The Exchange Energy due to the Short-Range Interaction, and the 
Density of States 

The deficiencies of the Hartree-Fock theory have been discussed at some 
length in Chapter 7. This theory takes into account the ‘accidental’ correla¬ 
tions among the positions of electrons with parallel spins, due to the Pauli 
principle, but completely neglects proper Coulomb correlations. As a 
result, ‘exchange’ effects are very much exaggerated, and, in particular, the 
density of states curve is much too wide and has the wrong shape near 
the Fermi surface [see Figure 7.8]. Now, according to the theory developed 
in the present chapter, the effects of long-range Coulomb correlations, 
among the positions of all electrons, regardless of spin, are embodied in the 
plasmons. Furthermore, these plasmons are not normally excited and, 
for the purpose of calculating the density of one-electron states in energy, 
they may be ignored, except in so far as they permit us to dispense with 
the long-range part of the Coulomb interaction. We thus expect that the 
difficulties of the Hartree-Fock theory will be largely overcome if the'residual 
short-range interaction is used in calculating the exchange energy, instead 
of the whole Coulomb interaction as before, because this is equivalent 
to taking into account long-range Coulomb correlations before applying 
the Pauli principle. 

The exchange energy associated with the state k in the Hartree-Fock 
theory, from equation (7.36), et seq., is 



(10.42) 


the sum being over all the orbital states within the Fermi sphere. In order 
to find the exchange energy associated with the state k in the Bohm-Pines 
theory, which we will denote by e xs (/c), we must substitute the expression 
(10.35) for e 2 Jr in (10.42), and also set v = 1, since we are considering 
unit volume. This gives, with suitable adjustment of notation to avoid 
confusing the various k vectors, 



*1 k 2 

(|fc,|<fc 0 ) (|fc 2 |>ICc) 
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The integral is throughout the unit cube, so that 

e i ( k-k l+ k 2 ) r dr = 0j jf k 2 ^k t -k, 

= 1, if k 2 = k x -k. 
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(10.44) 


and, hence, 


fixs(fe) = £ 


1 


tt \kt-kV 

(|kif <k 0 , 

|ki-k|>fc 0 ) 


(10.45) 


Substituting an integral for the sum, in the usual way, we find 


sU 2n 2 J\k 1 -k\ 2, 


(10.46) 


the region of integration being defined by 

|fcj| < /cq, | k± k\ > k c . 

In evaluating this integral we consider the two cases, k < k 0 —k c> 
k > k 0 —k c , separately. It is convenient in both cases, however, to use the 




W (b) 


Fig. 10.2. The region of integration for k x lies inside the Fermi 
sphere (large circle) and outside the shaded region, which, in case (a), 
represents a sphere of radius /c c , centred at A, and, in case (b), 
represents a portion of such a sphere. 


cylindrical coordinates (p, </>, z) with origin at k and z-axis in the direction 
of k, p being the perpendicular distance from k t to the z-axis, as shown 
in Figure 10.2. We then have 

\k t -k\ 2 = p 2 + z 2 . 


(10.47) 
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Owing to the cylindrical symmetry the azimuthal angle 0 need not appear 
explicitly, and we may take the volume element to be 2npdpdz. 

The expression (10.46) thus assumes the following forms: 


for k < k 0 — k c , 


2 pko-k pi 

s(*)=-- dz 

71 J ~(k + k 0 ) J 0 


[k 0 z-(z + kp]i- 


pdp 


2 . 2 
p + Z 


+ 


' r dz r 

71 J -fee 


(k c 2 -z 2 )i 


pdp 

p 2 + z 2 


for k > k 0 —k c , 

,2 r-k c 


e 2 r kc f c 
Ik) =- e - dz 

n J-(k + ko) Jo 


[k 0 2 -(z + kp]i 


pdp 

2 , 2 

p +Z 


e l[ 

n J _ 


2 /'(feo 2, — fe c 2 — fe 2 )/2fe /'[feo 2 -(z + fe) 2 ]^ 


dz 


r Lfeo z - 

J ft 2 -, 


pdp 


' (kc 2 -z 2 )± p 2 + Z 2 


The integrations are all elementary, and we obtain, finally, 
for k < k 0 —k c , 

£ xs(fc) = - f \2k 0 + log fe±b) -4k c ] , 

271 L k \k 0 -k/ J 


for k > k 0 —k c , 


£ U k ) = ~ — 

2n _ 


" kl — k 2 
k 0 H-log 


ft?) 


3k 2 + k 2 -k 2 

+ c 


(10.48) 


(10.49) 


(10.50) 


2k 


--2fe c J. (10.51) 


It should be noted that, in obtaining these expressions, we have assumed, 
as we shall continue to do in the following work, that k 0 > k c . 

The total one-electron energy e(k ) is obtained by adding the free-electron 
kinetic energy to e„(k), that is, 


h 2 k 2 

<*) = +^xs(fc). 

2 m 


(10.52) 


This is still a function of the magnitude k of the wave vector only, so that 
the energy suifaces remain spherical and the density of states is given by 
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equation (7.32), namely, 


J^(s) = —- — 
2n 2 ! d k 


(10.53) 


for unit volume. It can be seen immediately that^4 / '(e) does not fall to zero 
at the Fermi surface, as it does in the Hartree-Fock theory. Near the Fermi 
surface we must use equation (10.51) for e xs (fc), which gives 


d6 


_ h 2 k | e~ 
d k m 2 k 


2 r /,2 


K + k 2 


log 


fir) 


+ 


k 2 c —kl — 2 kk 0 — k 


2-1 


2 k 2 


(10.54) 


This does not tend to infinity as k k 0 , but to the finite value 


h kp 

2n 


m 


- [21og(^ + - 2 ] . (10.55) 

nl \kj 2kl 1 

It is convenient to introduce the dimensionless constant /?, defined by 

P = kjic 0 , (10.56) 

so that 

(—) =^ + -(2 log-+£-2). (10.57) 

\dk/ ko m 2n\ |3 2 / 

Now, in the Sommerfeld-Hartree theory only the first term on the right 
appears, the other being due to the exchange energy. Thus, if we denote 
by (e) and yF’(e) the densities of states in the Sommerfeld-Hartree and 
Bohm-Pines theories respectively, and by Co an d Ci the respective energies 
at the Fermi surface, we have, from (10.53), 


•^o(Co) 


= 1 + 


e 2 m 


- (2 log - + — — 2 ) . 
o'. IS 2 J 


^(Ci) 2nh 2 k 

Using formula (7.31), which gives 

k 0 = 1.919/r,, 

and measuring r s in Bohr units, (10.58) becomes 


(10.58) 


(10.59) 


2 Clo(Co) = 1+ ( 2 log- + £ - 2 ) . (10.60) 

yT(Cx) 12.06 \ 0 2 1 

We have not yet discussed in detail the evaluation of /?, although a rough 
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value is provided by equation (10.23). There is, in fact, no unique method 
of fixing this quantity precisely, but we shall see in § 10.8 that a reasonable 
value for sodium is about /? = 0.8. This, with r s = 4 B.u., gives 


- ^ (Co) = 1.05. 


(10.61) 


The density of states at the Fermi surface thus has very similar values in 
the two theories, and the same applies to any quantity which depends only 
upon this density. In particular, (10.61) is also the ratio of the calculated 
electronic specific heats in the two theories, so that these are very similar, 
and, furthermore, the difficulty regarding the temperature dependence of the 
electronic specific heat in the Hartree-Fock theory, discussed in § 7.7, no 
longer appears. 

Now let us investigate the band width according to the Bohm-Pines 
theory. From (10.50), (10.51) and (10.52), we obtain 


and 


K k o) 


2m 



2 — 2/1 + 



£ (0) = - C -^(4-4,8), 

2 n 


(10.62) 


(10.63) 


provided, in the latter case, that /J < 1, or k c < k 0 , as we have assumed. 
The band width is thus 


s(k o )-s(0) = (—2+20+ £■) . (10.64) 

2m 2n \ 2 / 

Again using equation 00.59), and atomic units, this becomes 

e(k 0 )-e( 0) = ^ - I-2 + 2/3 + —) ryd. (10.65) 

The first term on the right is the Sommerfeld-Hartree band width, and 
amounts to 0.23 ryd for sodium. If, as before, we take = 0.8 for this 
metal, the value of the second term is 0.01 ryd, so that the Bohm-Pines 
band width is practically the same as the Sommerfeld-Hartree band width. 
Indeed, the two are equal when /? = 2(^/2-1) = 0.828, since the second 
term on the right of (10.65) vanishes for this value. The Bohm-Pines band 
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width is thus in much better agreement with the experimental X-ray data, 
as discussed in § 7.9, than is that of the Hartree-Fock theory. 

We have now established that the essential features of the Bohm-Pines 
and Sommerfeld-Hartree ^V(e) curves are very similar. Of course, the 
detailed comparison of the two curves — whether one is lower and wider 
or higher and narrower than the other — depends upon the chosen value 
of (3, and there is a certain latitude here. None the less, whatever the value 
of (3, it is quite clear that the use of the short-range interaction, as justified 
by the plasma theory, overcomes the principal defects of the Hartree-Fock 
theory. 

It should be noted that the Bohm-Pines density of states, coming from 
(10.52) and (10.53), still neglects the effect of short-range Coulomb correla¬ 
tions. If we regard the short-range interaction as a perturbation applied 
to a system of initially non-interacting particles, the exchange energy 
e xs (k) is the first-order perturbation energy — the second-order energy 
would represent the effect of short-range correlations. Fletcher and 
Larson [1958] have calculated the latter and have found that, again dependent 
upon the value of (3, it can have a considerable effect upon the^F(e) curve. 
The qualitative results we have obtained above remain true, however. 

In the following section we shall be concerned with the total energy 
of the system, and, in preparation for this, we will now calculate the total 
exchange energy due to the short-range interaction. We will denote this 
by nE xs , where E xs is the average exchange energy per electron, and as in 
the Hartree-Fock theory, equation (7.49), we find 

»B„ = E e„(k). (10.66) 

k 

The sum is over all the orbital states within the Fermi sphere, and, replacing 
the sum by an integral, we obtain 

- pi = i?\}" (k)k2dk 

= -L f"'%„(*)fc 2 d/c+ — 2 f e x ,(k)k 2 dk. (10.67) 
2tt 2 J 0 2tt J ko - kc 

We have divided the range of integration with respect to k into two parts 
— e xs (k) is given, in the first, by (10.50), and, in the second, by (10.51). 
Nothing worse than elementary integration by parts is involved, and the 
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result is 


3e*k 0 n / 4/? | /? 2 /? 4 


( 


:) 


( 10 . 68 ) 


4ti V 3 2 48 


This gives the average exchange energy per electron, in atomic units, as 



(10.69) 


Now, the average exchange energy in the Hartree-Fock theory is — 0.91 6/r s 
(it is obtained from the Bohm-Pines result by setting /? = 0). If we take 
P = 0.8, for sodium, the factor 



has the value 0.26, so that the Bohm-Pines exchange energy is considerably 
less than that of the Hartree-Fock theory. We have seen in Chapter 7 that 
the principal advantage of the Hartree-Fock theory over the simple Hartree 
theory lies in the lower total energy produced by the large negative 
exchange energy. It would therefore appear, at first sight, as if the Bohm- 
Pines theory were energetically less favourable than the Hartree-Fock 
theory. We shall see in the following section that this is not so, however; 
for the smaller exchange energy is compensated by other energy terms, 
so that the total energy in the Bohm-Pines theory is, in fact, lower than 
that in the Hartree-Fock theory, even when the former does not include 
the effect of short-range correlations. 

10.7. The Correlation Energy 

It is difficult to make the calculation of the total energy of the system 
entirely convincing without going into details of the quantum-mechanical 
theory of plasma oscillations, which we do not wish to do. Not only would 
this take us beyond the scheme of the present work, but, owing to the 
fact that the quantum-mechanical theory is still in some dispute, it would 
probably raise more difficulties than it would resolve. None the less, we 
believe that the essential features of this calculation can at least be made 
plausible within our prescribed framework. 

According to the plasma theory, the system, which consists of n electrons 
and a uniform distribution of positive charge in a box of unit volume, can 
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be described approximately as a set of n particles having a screened Coulomb 
interaction, together with a set of plasmons, one for each k with k < k c . 
The number of plasmons is thus kl/6n 2 , the number of k vectors lying 
within a sphere of radius k c , the density being 1 /87t 3 .-In the ground state 
of the system, and it is the energy of this state we wish to find, the plasmons 
are all in their ground states, each having zero point energy ^h<x> p , and 
are represented by a product of ground state harmonic oscillator wave 
functions. Also, the screened interaction of the particles is of such short 
range that as a first approximation we may neglect it, in which case the 
particle wave function is a single determinant of free-electron functions 
including spin. 

The ground state wave function of the system is thus approximately a 
product of ground state harmonic oscillator functions and a single deter¬ 
minant of free-electron functions. This is only an approximation for two 
main reasons: first, we have neglected a small coupling which exists between 
the plasmons and the electrons, and, second, we have neglected the effect 
of the short-range interaction term i/ s r . on the particle wave function. 
A further difficulty may be noted here. The number of degrees of freedom 
of a system of n free par,tides is 3 n. By re-describing the system in terms 
of plasmons we have increased this number by kl/6n 2 , which is not per¬ 
missible. It is, in fact, necessary to impose a number of so-called subsidiary 
conditions, equal to the number of plasmons, upon the system in order to 
maintain the number of degrees of freedom at 3 n. These conditions, which 
express the relationship between the plasmon and particle coordinates, 
complicate the formal quantum-mechanical problem considerably. However, 
Bohm, Huang and Pines [1957] have shown that, as far as the calculation 
of the ground state energy of the system is concerned, the subsidiary con¬ 
ditions can be neglected, and so we will ignore them. 

We therefore expect the approximate total energy of the system to be a 
sum of the kinetic energy of the particles, the exchange energy due to the 
short-range interaction, and the zero point energies of the plasmons. 
This would, indeed, make the energy higher than that of the Hartree-Fock 
theory, but an important negative energy term has, in fact, been over¬ 
looked. The necessity for this term may be understood by reference to 
formula (10.33) for the potential energy of the whole Coulomb interaction. 
When the potential energy of the short-range interaction (those terms 
with k > k c ) is removed from this, the remainder may be expressed in 
the form 
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(10.70) 


Now, we have remarked that the density fluctuations p k , with k < k c , 
are described by the plasmons, so that the energy due to the first term of 
(10.70) is already included in the zero point energies of the plasmons. This 
does not apply to the second term, however, which is constant and must 
appear unchanged in the final energy expression. 

If we denote by E BP the average energy per electron in this approximation, 
the total energy of the system is, then, 


nE BP 


nE F + nE„+ 
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(10.71) 


on replacing the sum by an integral in the usual way. Using equations 
(10.56), (10.68) and the fact that k\ = 37 i 2 n, this gives 
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3 e 2 k 


4n 


\ 3 2 48/ 4 n 

2(i + £!-^\ + K^ 

\ 2 48/ 4 


(10.72) 


From equations (7.52), (10.26) and (10.59), we then have, in atomic units, 
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(10.73) 


The first two terms in (10.73) constitute the average energy per electron 
^hf in the Hartree-Fock theory [see equation (7.54)]. If we take /? = 0.8 
and r s = 4 B.u. for sodium, as in the previous section, the remaining three 
terms in (10.73) have the value -0.016 ryd, so that £ BP is lower than F hf . 
The reason for this lower energy is simply that the Bohm-Pines theory 
takes into account long-range correlations among the electronic positions, 
regardless of spin, while the Hartree-Fock theory does not. The difference 
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Zsgp — -£ hf is called the long-range correlation energy per electron, and we 
will denote it by W x ; thus, 


W t = ~ 


0.458 -2 0.866 o3 

- P 2 + —r-p + 


0.019 o4 


/T.ryd. 


(10.74) 


i 

This is not the whole of the correlation energy, as defined in §9.5, because 
£g P still neglects the effect of H s r on the particle wave function — in other 
words, the effect of short-range correlations. The term nE xs is the first-order 
perturbation correction due to H it _, obtained by using a single determinantal 
particle wave function, which would be correct only if H s r were zero. 
The correct particle wave function-should be an infinite sum of determinants 
of free-electron functions, which are the eigenfunctions of the unperturbed 
system, with coefficients determined by i/ s . r .. We have already remarked 
in § 9.5 that, if the whole Coulomb interaction term (10.30) is treated as a 
perturbation, the second-order energy correction diverges. Here, however, 
because the screened interaction is of such short range, the second-order 
perturbation energy turns out to be quite small, and it is not necessary 
to consider higher-order corrections. We call this second-order perturbation 
energy, which is of course negative, the short-range correlation energy and 
denote it per electron by fV s . 

From equation (3.134) we thus find 


W. = - 1 E 

n j ±o 


l(H s , r ,) 0J | 2 

Ej-E 0 


(10.74) 


where 

- J <H.,. S',dr'. 


(10.75) 


the normalized functions W 0 and Vj being nth-order determinants of free- 
electron functions, the former representing the ground state and the latter 
an excited state of the unperturbed system. E 0 and Ej are the corresponding 
kinetic energies, that is, the energies of the system of particles in the states 
T 0 and Wj when no interaction is present. 

The calculation of the matrix elements (10.75) is too lengthy for inclusion 
here. However, it may be shown that the only non-zero matrix elements 
are those for which *Fj contains two and only two excited one-electron 
functions. That is to say, Vj must be obtained from *P 0 exciting two 
electrons lt> states lying outside the Fermi sphere. These excited electrons 
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can have either parallel or antiparallel spins, but it is found that the matrix 
element is zero unless the spins of the excited states are the same as those 
of the initial states. Now, the terms in (10.74) corresponding to the excitation 
of two electrons with parallel spins give the part of the short-range correla¬ 
tion energy due to parallel spin correlations and the remaining terms give 
the part due to antiparallel spin correlations. We know, however, that the 
Pauli principle has the effect of keeping electrons with parallel spins apart, 
reducing the effectiveness of H S T , so it is reasonable to assume that the 
parallel spin part of W s will be small, and this is borne out by calculation. 
For simplicity, therefore, we will ignore the contribution of parallel spin 
correlations to W s . 

The only Wj we need consider, then, are of the type obtained from !P 0 
by raising two electrons with wave vectors k t and k 2 , and opposite spins, 
to excited levels lying outside the Fermi sphere. It also appears in the 
calculation that the wave vectors of the excited states must be k^ + k and 
k 2 — k, with k > k c , so that the total momentum is automatically conserved. 
We thus find 

(tfs.r.)o J = 4ne 2 lk 2 , (10.76) 

Ej-E 0 = -^-[(fe 1 + fc)%^(fc.._fc) 2 -fe2_ fe 2] 


= — fc • (fej -k 2 + k), 
m 

and, hence, 

W = _ i V V V m(47ie 2 ) 2 

n k ft, k 2 h 2 k 4 k • (k t — k 2 + k) , 


( 10 . 11 ) 


(10.78) 


where k > k c , and k { , k 2 lie inside the Fermi sphere while k x +k and 
k 2 — k he outside it. 

If we now replace the sums by integrals, we obtain 


m(4n e 2 ) 2 fff dkdk l dk 2 

nh 2 (Sn 3 ) 3 J J J k A k • (k { — k 2 + k) ’ 


the region of integration being 


(10.79) 


l^il> 1^1 < k > K, Ifcj+fel > k 0 , \k 2 — k\ > k 0 . 

Finally, expressing k, |fc 1 | and |fe 2 | in units ofk 0 and the energy in rydbergs, 
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we have 


J_fff dtdt.dfc, _ fyd 

167i 5 JJJ k 4 k • (fci — k 2 + k) 


(10.80) 


where | k t \, \k 2 \ < 1, k > P, \k t + k\ > 1, \k 2 -k\ > 1. The integration is 
tedious and we merely quote the result given by Pines [1955]: 

W s = — (0.0254 — 0.0626 log p + 0.006 37/? 2 ) ryd. (10.81) 

The total correlation energy per electron, the sum of W x and fV s , is thus 


^ = _^ 2+ ?^V + o^iv 

r s r t r s 

-0.0254 + 0.0626 log p- 0.006 37j5 2 ryd. (10.82) 

Before applying this to any metal, let us consider in more detail the evalua¬ 
tion of p. 


10.8. The Value of P 

Pines [1955] originally proposed to choose that value of the parameter 
p which minimizes W x , or, what is the same thing, the value which minimizes 
^p. This is a reasonable procedure, since it gives the greatest improvement 
of the Bohm-Pines total energy on that of the Hartree-Fock theory, without 
taking into account short-range correlations — in other words, it takes 
the greatest advantage of the inclusion of long-range correlations in the 
plasma theory. Neglecting the small term 0.019/? 4 /r s in (10.74), we find 

(10.83) 

dp r s r t 

and equating this to zero gives 

P = 0.35r±, (10.84) 


or 


k c = 0.35= 0.67r, *, 


(10.85) 


where r s is in Bohr units. According to this formula, P - 0.70 for sodium, 
and the screening distance is k c 1 = 3.0 B.u. = 1.6 A. 

If the energy due to the interaction between the plasmons and electrons, 
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which we have neglected in our calculation, is included in W, before minimi¬ 
zation (see Pines [1958]), a slightly larger value, roughly 

p = 0.40 r* (10.86) 


is obtained. This gives P = 0.80 for sodium, which is the value we used 
in § 10.6. 

As far as the calculation of the density of states is concerned it is clearly 
advantageous to have ft as large as possible, for this will reduce the short- 
range interaction to a minimum and so improve the accuracy of the in¬ 
dependent particle approximation. However, it is not possible to choose 
an arbitrarily large value of ft. Equation (10.23) suggests, and this is borne 
out by quantum-mechanical considerations, that an upper bound to the 
value of P is 


Now 


and 


so that (10.87) becomes 


or 


p = 

k 0 v 0 

(10.87) 

h 2 k 2 

" K 0 1 2 
=-= i*nv 0 , 

2 m 

(10.88) 

i 

ii 

o 

(10.89) 

ft _ ftc °p 

2(o ’ 

(10.90) 

P = 0.47 rj, 

(10.91) 


from equation (10.29). This gives 0 = 0.94 and k~ l = 1.2 A for sodium. 

If we insert any of these values of 0 in equation (10.82) and retain only 
the significant terms for actual metallic densities (that is, for values of r s 
lying between 2 and 6 B.u.), we obtain 


W = C+0.031 log r s , (10.92) 

approximately, the constant C having a value in the neighbourhood of 
— 0.1. Nozieres and Pines [1958] have considered the correlation energy 
in more detail, taking into account the interaction between plasmons and 
electrons, and have concluded that, with a probable error of less than 15 %, 

W = —0.115 + 0.031 log r s (10.93) 
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in the region of actual metallic densities. This is the formula quoted in 
§ 9.5; it gives W = —0.072 ryd for sodium, in good agreement with the 
value — 0.075 ryd given by the formula (9.82) of Wigner. 

It is of interest to note that the value of /? according to the Thomas- 
Fermi theory, if we take k ~ 1 to be the screening distance X given by equation 
(10.41), is 

jS TF = 0.81r* (10.94) 

which is nearly twice as great as the largest value permitted by the plasma 
theory. 

10.9. Experimental Evidence of Plasma Oscillations in Solids 

So far the validity of the plasma theory has rested upon its ability to 
justify the independent particle approximation, which we know to be in 
reasonable agreement with experiment, and to provide a value of the 
correlation energy of the size required in cohesive energy calculations. 
More direct evidence of the existence of plasma oscillations in metals is 
provided by the energy losses of fast electrons in penetrating thin metallic 
films. 

It has been shown in § 10.4 that plasmons are unlikely to be excited by 
heating, since even at quite high temperatures no electron has thermal 
energy hco p to give away. However, if a very fast electron enters a metal 
from outside, there is a good chance that it will excite one or more plasmons. 
Thus, if a beam of high energy electrons is shot through a metallic film, 
and we assume that the electrons in the metal are practically free, we should 
expect some of the emergent electrons to have lost energy h(o p , others 
2heo p , others 2>hco p , and so on, according to the number of plasmons they 
have excited. Of course, even in a free-electron gas, the energy losses will 
not be in multiples of exactly h(o p , owing to dispersion [see equation 
(10.3)], but this will merely broaden the spectral lines slightly. 

In fact, many experiments have now been carried out (for a summary 
of the experimental results see Pines [1955] or [1956]) in which beams of 
electrons with energies of several kilovolts are passed through metallic 
films, generally of the order of 100 A in thickness, and the energy loss 
spectra determined. Such high energies are not strictly required in order to 
excite plasmons, since hco p is in the region of 10 eV, but they are used in 
order that the electrons will be scattered through only a small angle in 
passing through the film, which makes the emergent beam easier to handle. 
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It is found that, for some metals, in particular beryllium, magnesium, and 
aluminium, there are cha¬ 
racteristic energy losses at 
almost exact integral mul¬ 
tiples of hco p , the electron 
density used in computing 
(O p being that of the valence 
electrons in the metal, that 
is, two per atom for Be and 
Mg and three per atom for 
Al. The energy loss spectrum 
of aluminium, found by 
Blackstock, Ritchie and 
Birkhoff [1955], is shown 
in Figure 10.3; h(n p is 16 eV 
for this metal and peaks 
appear at intervals of very 
nearly 15 eV. However, the 
energy loss spectra of many 
other metals, and several 
non-metals, such as carbon, silicon and germanium, show at least one 
pronounced peak at about h<o p , where again the density of valence electrons, 
four per atom for C, Si and Ge, is used in'computing <o p . 

On the other hand, agreement between the free-electron plasma theory 
and the experimental loss spectra is completely lacking for metals such as 
copper, silver, gold and the transition metals with well-filled d sub-shells. 
Copper, for instance, has a single broad peak at about 2hco p , when the 
electron density is taken to be one per atom. This is not very surprising, 
for where the ion-cores are large the distinction between valence and core 
electrons tends to break down, and we should expect a direct participation 
by the core electrons in the plasma oscillations. The core electrons, even 
in the d sub-shells of the transition metals, are by no means free, however, 
so that we should not expect the free-electron plasma theory to apply in 
this case. 

Even in metals such as sodium, magnesium and aluminium, in which 
the ion-cores are small, the periodic field of the ionic lattice is expected to 
have some effect on the plasma oscillations, for it causes the one-electron 
wave functions to differ in some degree from free-electron functions. It 



Fig. 10.3. Energy loss spectrum of 45 keV electrons 
after passing through an aluminium film of thick¬ 
ness about 550 A. Peaks occur at intervals of 
approximately 15 eV, while ficop for this metal is 
16 eV (after Blackstock, Ritchie and Birkhoff 
[1955]). 
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might be imagined that the plasma frequency in an alkali metal, for example, 
could be obtained quite simply by substituting the effective mass m*, dis¬ 
cussed in § 9.3, for the ordinary electronic mass m in the formula for co p , 
According to a theory due originally to Mott (see, for example, Raimes 
[1957]), however, this is not so, and the plasma frequency in such metals 
should rather be simply the free-electron frequency, with a small correction 
for the polarizability of the ion-cores. 

This theory, which is now known as the dielectric theory of plasma 
oscillations, has been extensively developed by a number of people. It 
promises to be a powerful means of studying the effects of electronic inter¬ 
actions in solids, but so far little progress has been made owing to lack of 
the necessary experimental data. However, one of the elementary results 
of the theory is that the frequency-dependent dielectric constant l+4rox, 
where a is the polarizability of the metal, must vanish at the plasma 
frequency. Unfortunately, optical data in the relevant frequency range, 
the far ultra-violet, exist only for the alkali metals, but for these metals 
it is indeed found that the dielectric constant vanishes, that is to say, the 
metal becomes transparent, for light of almost exactly the angular frequency 
co p . This is further confirmation of the applicability of the free-electron 
Dlasma theory to such metals. 

In conclusion, we would emphasize once more that the theory we have 
described in this chapter is not free from difficulties, and it may prove to 
be quantitatively unreliable without many tedious refinements which would 
detract from its elegance and practical utility. None the less, even if only 
qualitatively valid, it will have served a most useful purpose in providing 
a long-needed justification for the independent particle approximation. 
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CHAPTER 11 


TIME-DEPENDENCE AND TRANSITION PROBABILITIES 

11.1. The Time-Dependent Schrodinger Equation 

We have reserved a discussion of the time-dependence of states until 
this final chapter, not because it is more difficult than the previous work, 
nor because a knowledge of all that has gone before is required for its 
understanding, but simply in order to emphasize that we have so far been 
dealing only with stationary states , whose dependence on time is trivial. 
We have, in fact, only considered conservative systems, whose Hamiltonians 
do not contain the time explicitly. However, a problem which frequently 
arises in atomic physics is that of a system, initially in a given stationary 
state, which is subject to a perturbation of only temporary duration or 
varying with time. This might, for example, be due to the passage of a 
charged particle or a light wave through the system. There is then a finite 
probability that in a certain time the system will have made a transition 
to some other stationary state. This is called a transition probability, and 
in order to calculate it we need to use a modified Schrodinger equation 
which applies when the energy of the system is not necessarily constant. 

The form of the required equation is suggested by considerations like 
those introduced in § 2.1. The classical Hamiltonian H and energy E are 
now both functions of the time t, so that the energy equation is 


H{q, p, t) = E(t) 


( 11 . 1 ) 


(for compactness we use q and p to denote the whole set of cartesian coor¬ 
dinates and conjugate momenta of the system). Let us define the function 
by 


^(q,p, t, E) = H(q, p, t) - E. 


It can immediately be verified that JT may be used as the Hamiltonian of 
the system, for 


dJe dH 

Op Op 



op 


310 


(11.2a) 
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and 


b3/e dH 



(11.2b) 


— in other words, Hamilton’s equations have the same form whether 
or H is used. However, we now have the two further equations 


and 


dJ? . . , 1M 

— =-l = *, (11.3a) 

dE 

dJf _ dH 
dt dt 


The latter equation may be written, using (11.1), 

(11.3b) 

d(-t) 

Comparing (11.3a) and (11.3b) with (11.2a) and (11.2b) respectively, we 
deduce that E is conjugate to -t. Now, in obtaining the Schrodinger 
equation from the classical energy equation, we replace the momentum p 
conjugate to q by the operator -i h d/dq. The foregoing suggests that E 
should also be replaced by the operator -i h d/d(-t); that is, we should 
write 

£ = ift - . (11.4) 

dt 


Substituting in equation (11.1), and operating with both sides upon 
a function t W(q, t), we obtain 



which is the time-dependent Schrodinger equation . The function W must, 
of course, satisfy the conditions imposed upon \j/ in § 2.2. 

For a system of N particles with a potential energy V which depends 
explicitly upon the time, the equation is thus 


t Throughout the present chapter we shall use W to denote a function of the particle 
coordinates and the time, and y) to denote a function of the particle coordinates 
only. 
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N i.2 


h 

- Z - Vf + V(x u y t , z„ ...x Nt y N ,z N , 0 

; = i2m ; 


^(*i> • • • Zn, t) 


= ih ^ W(x u ... z N , t). (11.6) 

ot 


It should be emphasized that we have not given a rigorous derivation of 
the Schrodinger time-dependent equation from classical mechanics. No 
such derivation exists, any more than it does for the time-free equation 
considered previously. Nor is a derivation necessary — it is sufficient to 
postulate equation (11.5) as one of the basic equations of wave mechanics. 
We have merely been concerned to remove a little of the arbitrariness from 
such a postulate, but, as always, the justification lies in the results. 

Of course, equation (11.5) must remain valid even when we are dealing 
with a conservative system. In this case H does not contain t, and the equa¬ 
tion is separable. We write 


substitute in (11.5), and divide through by 7', obtaining 

+ f it 


( 118 ) 


The left-hand side is a function of q only and the right-hand side a function 
of t only, so that both must equal the same separation constant X. Thus, 

H\p = X\jt. (11.9) 


But this is just the ordinary (time-free) Schrodinger equation, and X must 
therefore be one of the energy eigenvalues, E n say, and i j/ the corresponding 
eigenfunction i j/ n . The equation for / is then 


with the general solution 


Id/ i , 

-=- , 

fdt h 

/(f) = Ae- iEnt, \ 


( 11 . 10 ) 


( 11 . 11 ) 


where A is an arbitrary constant. A separable solution of (11.5) for a 
conservative system is therefore 


¥ n {q, t) = AiJ/ n (q)c iEnt, \ (11.12) 


and, since the equation is linear, the general solution may be expressed as 
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a sum of such solutions for all the states t {/„; that is, 

n?,') = ZA«9)e-' £ -''*, (U.13) 

n 

the coefficients A n being constants. When t = 0, we have 

¥(4, 0) = E A n'l'n(<l), 

n 

and, if the i j/„ are normahzed as well as orthogonal, we find by the method 
of Appendix 2 

A n = J^*(q)^(q, 0)dq, (11-14) 

the integration being over the configuration space of the system. 

In the general case of a non-conservative system, let us suppose that 
\J/ n and E n are the eigenfunctions and corresponding eigenvalues of the 
equation 

H 0 \jj = Exl>, (11.15) 

where 



the Hamiltonian at time t = 0 (we can choose t = 0 to be any convenient 
instant). Then the general solution of equation (11.5) may be written 

n.q, t ) = E A n (t)Uq>~ iEnt '^ (11.16) 

n 

where the coefficients A„ are now functions of t. This is because, at any 
fixed time t, it is always possible to expand the function T{q, t) in terms 
of a complete orthogonal set of functions of q, but the coefficients will, 
of course, depend upon the value of t. The exponential factors in (11.16) 
are really superfluous so long as the A„(t) are not specified, but are included 
so that, in the case of a conservative system, the expansion reduces to 
(11.13) with A„ constant. 

11.2. Probability Density and Expectation Values 

If W is normahzed according to 

j\W(q,t)\ 2 dq = 1 , (11-17) 

then \^\ 2 , or X F* X F, is interpreted as a probability density. In other words, 
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|!F(<7, t)\ 2 dq is the probability that the configuration of the system will 
lie in the small interval dq about q at the time t. 

It is implicit in the normalizing equation (11.17) that the integral of 
\T(q, t')\ 2 over the configuration space of the system must be a constant, 
independent of t. This is easily seen to be true for the function x P n given 
in equation (11.12), but we will now prove it to be so in general. The Schro- 
dinger equation and its complex conjugate are 


dV 

(11.18) 

dV* 

i h . 

dt 

(11.19) 


Multiplying the former by and subtracting the latter multiplied by 


!P gives 



i h 

BY dY* d 

V*— + V = (¥*¥). 

dt dt dt 

(11.20) 

Hence 




^(y*H'¥-'¥H*'¥*)dq. 

(11.21) 


However, owing to the Hermitian character of H [see equation (3.58)], 


J 'P*H'Pdq = J q s 

(11.22) 

so that 


sJ " °’ 

(11.23) 

and this is what we wished to prove. 



The expectation value </> of an operator/^,/?) is defined as in §2.7 by 


</> -J »>*(«. «)/(«.“) (U.24) 

when Y is normalized. 

For a conservative system in the state given by equation (11.13) the 
expectation value of the energy is 
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<H> = J ¥>*(«, 1)H («.“) n«. <)d9 

= f (I | (I A m 4, me -‘ E -'‘)iq 

J n Ot m 

= EE4W™ f 

n m J 

= E M„l 2 £„ (11.25) 

n 

assuming that the i//„ are normalized as well as orthogonal. This formula 
is only true provided IF is normalized, that is to say, 

f | ¥(q, t)\ 2 dq = £ £ A*A m e i(£n_£m)r/ * f ils*^ m dq 
J n m ^ 

= ZM,I 2 = 1- (11-26) 

n 

Now, a measurement of the energy of the system must yield one of the 
energy eigenvalues E„. We therefore deduce from (11.25) and (11.26) that 
\A n \ 2 is to be interpreted as the probability that the measured value of the 
energy will be E„. 

For the particular state 

!?.(«,<) = «4)e-' E -''* (11-27) 

of a conservative system, we note that the energy is certain to have the 
value E„. Also 

\^n(g,t )\ 2 = \Ug)\ 2 , ( 11 - 28 ) 

so that the probability density does not change with time, and the above 
probability interpretation of |*F| 2 reduces to the one we used earlier in the 
book. Such a state is what we have referred to as a stationary state. Since 
it is |!F| 2 and not !F itself which has physical meaning, it is clear that so 
long as we only consider stationary states the time-dependence is not 
significant. 

11.3. The Current Density 

Let us for simplicity consider a single electron moving in an electric field, 
with potential energy V(r). The Hamiltonian is 

H = — — V 2 + F(r), 

2m 


(11.29) 
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and equation (11.20) becomes 

— |»P| 2 = - — (<F*V 2 'F-'FV 2 '¥*). (11.30) 

dt 2mi 

This is the same as 

-\<F\ 2 = - —div('F*V'F-'FV'¥*), (11.31) 

dt 2mi 


since 

V • (f/*V!P) = VW* ■ V'F + 'F*V 2 'F. 

It will now be convenient to use the charge cloud interpretation of | 'F\ 2 , 
introduced for \\J/\ 2 in § 2.6, which is closely related to the probability 
interpretation. Thus, we assume that the charge -e of the electron is 
spread out into a charge cloud of density 

P=-e\'F\ 2 , (11.32) 

which in general is a function of position and of time. The electric current 
density j is then related to p by the equation of continuity 

-£ = - divj, (11.33) 

dt 


which merely expresses the fact that the rate at which charge is emerging 
from a volume element must equal the rate at which the charge within the 
element is decreasing. Comparing (11.331 with (11.31), we find that 

eh 

j = -(‘P'VI'-S'VS'*). (11.34) 

2mi 

If dS is a small plane element of area and j n is the component of j per¬ 
pendicular to the element, then j„ d*S is the amount of charge passing through 
dS per unit time. 

Of course, it is not necessary to use the charge cloud interpretation. 
If we omitted the charge -e from (11.32) and (11.34), then j would be the 
probability current density. On the other hand, if we substituted m for -e, 
then j would be the mass current density. 

Since j = pv, where v is the mean velocity of charge or mass at any 
point, we have 

h 

w =-(5'*V1'-'f'V'f'*)/|»F| 2 . 

2mi 


(11.35) 
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We note that, unless a state W is a complex function of the spatial coor¬ 
dinates, there is no current associated with it. For example, the eigen¬ 
functions of a free particle moving along the x-axis are 


«*) = e lh , 

(11.36) 

with unquantized energies 

(]A h 2 k 2 
e(k) = _ . 

(11.37) 

2m 

The time-dependent wave function of a stationary state is, 

thus, 

W k (x, t) = e ift *e- ie(Wfi . 

(11.38) 


This is not normalized, but none the less may be substituted in equation 
(11.35), since normalization would multiply the numerator and the denomi¬ 
nator by the same constant. We find 



= —. (11.39) 

m 

For a particle in a one-dimensional box, however, with eigenfunctions 

\j/ k (x) = sin kx, (11.40) 

and the same time factor as in (11.38), we find v = 0. 

For a stationary state of any conservative system, with wave function 
of the form 

V{<1> 0 = (H- 41 ) 

it is easily seen that 

v = — (iA*ViA-tAV^*)/|t//| 2 . (11.42) 

2mi 

11.4. The Average Velocity Associated with a One-Electron State in a 
Metal 

We assume that a valence electron in a metal is in a stationary state 
whose Bloch wave function is 
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•AkW = e i *' r u k (V> 

satisfying the Schrodinger equation 

V 2 I h+ ^ M*)- v('W t = o. 

n 


(11.43) 


(11.44) 


We wish to determine the average velocity <r t > of the electron, taken 
throughout the volume of the metal, which, according to equation (11.42), 
is given by 

<*k> = j'>k\'l'k \ 2 & 

= ~z~. f (<Ak V '/'k-'/' / fcVtAfc)dr, ( 11 . 45 ) 

2miJ 

provided < l/ k is normalized, so that 

Jl^kl 2 dr = J|u k | 2 dr = 1 . ( 11 . 46 ) 

Differentiating (11.44) partially with respect to k lt the x-component 
of k, we obtain 


+ *.+(,_ v)^ 

dk t h 2 L dk, K J dkj 


= 0. 


But, from (11.43), we have 


^ = ix«K + e ik ''^, 
dki dk x 


so that 


(11.47) 


(11.48) 


v2 # l = 2i ^ xy2 

dk, dx ' dkj 

Equation (11.47) thus becomes 



using (11.44). Multiplying by \j/*, and integrating throughout 
of the metal, gives 


(11.49) 


(11.50) 
the volume 
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i 



f dx 


dr = — 


m de 
h* dk[ ’ 


(11.51) 


the contribution from the last term of (11.50) being zero for the following 
reason. According to Green’s theorem [Appendix 1, equation (A1.23)], 



(11.52) 


The latter integral is over the surface of the metal, and vanishes owing 
to the periodicity of the integrand. Hence, 




r 

dkj 

dkj 


dr 



2m . ' 


<dr = 0. 


(11.53) 


since t J/* also satisfies equation (11.44). 

There are two equations similar to (11.51), corresponding to the other 
two components of k, and all three equations may be combined to give 


where 


i f ^V^dr = - ~ grad* e(k) = - ^ V k e(fc), (11.54) 
J h n 


„ d d 

V k = ei — 4" &2 — 
dk t dk 2 


+ e 3 


_d_ 

dk 3 ' 


The energy is real, and so we must also have 

01 - 55 ) 


Substituting (11.54) and (11.55) in (11.45), we obtain, finally, 

<».> = l M*). (11.56) 

n 

This is the relation we used in our discussion of metallic conduction 
in § 8.12. It shows, among other things, that a perfect lattice has no resistance. 
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for an electron in a state k will always have the same average velocity. 
In the case of a free electron, with V(r ) = 0, we have 


and equation (11.56) gives 


e(k) = 


ft 2 fc 2 

2m 



m 


(11.57) 


(11.58) 


Equation (11.42) tells us, in fact, that v k is constant, with the above value. 

As a second example, let us consider the tight binding method applied 
to a simple cubic lattice. Equation (8.116) gives the energy of the state 
k as 

e(k ) = s 0 —a —2y (cos k t a+cos k 2 a+cos k 3 a), 

so that 


V fc e(k) = 2ya(e t sin k t a+e 2 sin k 2 a+e 3 sin k 3 a). (11.59) 


The x-component <n x ) of the average velocity, for example, depends upon 
k l as shown in Figure 11.1. It is zero not only at the centre of the zone, 
but also at the zone boundaries k t = ±n/a. 



Fig. 11.1. The x-component of the average velocity as a function 
of k it according to the tight binding theory for a simple cubic lattice. 


More sophisticated models give similar results. When the energy surfaces 
intersect the zone faces orthogonally, the component of <r fc > normal to 
a zone face must be zero over the whole face. 


11.5. Wave Packets 

Let us consider a free particle moving along the x-axis. The time-dependent 
Schrodinger equation is 


h 2 d 2{ P dY 
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with the particular solution 

W k (x, t) = e i(kx_a>,) > (11.61) 

a plane wave with wavelength 2n/k and angular frequency 

= = ( 11 . 62 ) 

h 2m 

The energy e(k) may have any value, so that the general solution of (11.60) is 

W(x, t) = f°° A(k)e i(kx - mt) dk. (11.63) 

J — 00 

This is known as a wave packet. A(k) is an arbitrary function (it must, of 
course, be such that the integral will converge), but the most useful wave 
packets are those for which A{k) is large for only a small range of values of k. 
As a specific example, let us suppose that A(Jc) is the Gaussian function 

A{k) = cz-° Hk - K) \ (11.64) 

where c, a and K are constants. The value of A(k) is a maximum when 




(c) 

Fig. 11.2. Graphs of the functions A(k), \W(x, 0)| 2 , and SiW^x, 0), 
tjie real part of W(x, 0). 


k = K, and falls to 1/e times this value when k—K= ±1 la, tending 
rapidly to zero for larger values of \k—K\, as shown in Figure 11.2 (a). 
The range of k which contributes significantly to the integral (11.63) is 
thus roughly of width 2/a and symmetrical about K. 
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Substituting (11.64) in (11.63), we obtain 

«P(x, 0 = c f °° e l-°Hk-K)>+ilkx-wV 2 m)n) dk ' 
J — 00 

Now let us write t, = k—K, so that 

2 E 2 • /, hk 2 \ 

— a £ + i /ex-n 

\ 2m / 

= -a 2 Z + ix(K + Z)-—(K + Z) 2 

2m 

= -ae-2K-y, 


where 


2 i ht 

a - a --, 

2m 


0 i / htK\ 

f -2\ - x+ v)’ 


and 


Then 


= iK (- 


X + 


htK \ 
2m) ' 


^(x, t) = c\ e ai2 2fii Mi; 


= ce (p2/a) " y I P 


= ce (p2/a) ~ y 


j: 

- c] Ji 


XP 2 l a ) ~~ y 


-V 71 


2 iht\ 

a H- 


exp 


2m) 


x — 


htKV 


m 


4 („*+*!) 

\ 2m/ 


\ 2m/ 


In particular, 


Cv/rc - 

—_ 1 _ p 


1>(x,0) = -X— e -;v2/4a2 e ijcK , 


(11.65) 


( 11 . 66 ) 


. (11.67) 




( 11 . 68 ) 












Ch. 11, § 5] 


WAVE PACKETS 


323 


the real part of which is shown in Figure 11.2 (c). The factor exp (i xK) 
is a plane wave profile, with wavelength 2n[K, and its amplitude is modulated 
by the factor exp (— x 2 /4a 2 ). 

The probability density at t = 0 is 

\V(x, 0)1 2 = ^ e ~x 2 / 2 a 2 (11.69) 

a 

This is the Gaussian function shown in Figure 11.2(b), with its maximum 
at x = 0. It falls to 1/e times its maximum value when x = ±a^j2. Nearly 
the whole of the area under the curve (c must be chosen so that this is unity) 
comes from the range —a^j2 < x < aj2. We may therefore say that, 
at t = 0, a particle in the state ¥ will almost certainly be found within 
a distance a^j2 of the origin. We may therefore take 

Ax = a^2 (11.70) 

to be the uncertainty in the position of the particle at t = 0. 

Now equation (11.39) tells us that the momentum of a free particle in a 
stationary state k is p = mv = hk. The state F is not a stationary state, 
however, and the k values are spread over an infinite range, but the most 
important k values lie within 1/a of K. We may thus take the uncertainty 
Ak in the value of k to be 1/a, and the uncertainty in the momentum of the 
particle to be 

Ap = f>Ak = h/a. (11.71) 

From (11.70) and (11.71), we have 

Ax-Ap = h^/2, (11.72) 

at t = 0, in agreement with the uncertainty principle of Heisenberg, which 
states that 

Ax ■ Ap ^ \h (1T73) 

(in obtaining this expression, in § 2.10, we defined Ax and Ap as the root- 
mean-square values of the deviations from the mean, which are smaller 
than the values used above). 

It follows from (11.67) {hat 



(11.74) 
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At any time t the maximum of this function occurs at x = htK/m, so that 
the maximum probability density, or the centre of the wave packet, is 
moving in the positive xc-direction with velocity 


v 


g 


hK 

m 


(11.75) 


This is known as the group velocity of the packet. We also note that, as t 
increases, the curve of l^l 2 against xc flattens out and broadens gradually, 
but remains Gaussian, with the same total area. 

Since co = hk 2 /2m, we have 


v 


g 



(11.76) 


We will now show that this formula for the group velocity remains true 
whatever the dependence of co upon k. 

Let us first consider a single plane wave 

e i(kx-wt) (n. 77 ) 

This function has value unity when kx = cot. The wave profile (see § 1.3) 
thus travels with velocity 

u = cofk, (11.78) 

which is called the phase velocity or wave velocity. 

Now let us add to (11.77) another plane wave, specified by k + dk, 
oo+5(o. We obtain 


^i[(fc + <5fc)x-(co-f <5co)f] 

|* e “±H*<5fc-f<5o) + Q Yi(xdx-tdo)-j Q i[(k + ±dk)x-((o + ±d(Q)t-\ 

= 2 cos $(x5k-t5(o)e m+iSk)x - (( ° +id<o)t \ (11.79) 

If an d <5co are small, this is virtually the wave (11.77) with its amplitude 
modulated by the function 2 cos (%xdk—t 5co). The real parts of the two 
wave profiles and their sum are shown in Figure 11.3. Formula (11.79) 
hus represents a series of wave groups whose maximum amplitude moves 
in the positive xc-direction with the group velocity 
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If co is a function of k, then in the limit as <5 k tends to zero, we may write 


v 


s 


dco 
dk ' 


(11.81) 


If we form a wave packet, as in (11.63), from plane waves wnose k values 
all lie close to a given value K, then the same argument will apply to any 
pair of waves, and the group velocity will be given by (11.76). 

We note that the phase velocity of the waves in (11.63) is 


a) hk 
k 2 m 


(11.82) 


which for the essential k values is nearly enough hKjlm, that is to say, 
only one half of the group velocity. 




Fig. 11.3. Two plane wave profiles with slightly differing wavelengths, 

and their sum. 


The classical velocity of the particle must be identified with the group 
velocity of a packet, and not the phase velocity of a single wave. The mean 
wavelength is 

A = - = — , (U.83) 

K mv g 

from (11.75), which agrees with the de Broglie formula (1.5) only if the 
particle velocity is taken to be v g . 
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Finally, we may use the concept of group velocity to obtain another 
uncertainty relation of Heisenberg, which we have not yet considered. 
The uncertainty At in the time at which a particle associated with a wave 
packet passes a given point on the x-axis must equal the uncertainty in x 
divided by the group velocity; thus, 


Ax _ hAx 
v s de/d k ’ 


(11.84) 


s. a> = e/h. However, the uncertainty As in the energy of the particle 
is related to the uncertainty Ak by 


Ae = — Ak, 
d k 

assuming Ak to be small, so that 


(11.85) 


At • Ae = hAx • Ak = Ax • Ap. 
Hence, from (11.73), we have 

At ■ Ae ^ \h. 


( 11 . 86 ) 

(11.87) 


11.6. Equation of Motion of a Wave Packet 


We have so far been talking about wave packets which represent the 
motion of a free particle. Now let us consider the more general case of a 
particle moving under the action of a field. We will, for simplicity, still 
restrict ourselves to motion along the x-axis, the potential energy of the 
particle due to the field being F(x). The time-dependent Schrodinger 
equation is now 


ft 2 a 2 y 

2m fix 2 


+ V(x)'F = ih 


dV 

17 


( 11 . 88 ) 


A particular solution is 


^(x, f) = <A*(x)e ie(fc)(/fi , (11.89) 

where \[/ k is an eigenfunction, and e(k) the corresponding eigenvalue, of 

■ t U + = s *- «>- 9 °) 


The general solution may be written 
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n*,0 = L^ k (*)e" i,<fc)t/ft > (U- 91 ) 

ft 

the coefficients A k being constants. If the energy spectrum is continuous, 
or if the discrete energy levels are sufficiently closely spaced, the sum in 
(11.91) may be replaced by an integral: 

^(x, 0=|A(/c)^ fc (x)e- ie(fc)t/ft d/c. (11.92) 


We will not consider a specific function 'F, but will merely assume that 
this has the form of a wave packet, that is to say, \F\ is virtually zero 
except in a small range of values of x at any time. The average x-coordinate, 
or the centre of the packet, is given by 



x'F*'Fdx. 


(11.93) 


The rate of change of this with time is the group velocity of the packet, 
that is, 


d(x) 
d t 



(11.94) 


We multiply equation (11.88) by xF*, subtract the complex conjugate 
equation multiplied by xF, and integrate over all x, obtaining 


; h^> 
d t 



2m 




d 2 W* 

dx 2 




dx, 

m J. m dx 


(11.95) 


after two integrations by parts, using the fact that W vanishes at infinity. 
Differentiating again with respect to time, we obtain 

m *v>-±r i e ii e JL + ^^)6 X , 

d t 2 i J - a, \ dt dx dxdtl 
and integration by parts of the second term in brackets gives 


d 2 <x> tiT (d'F'd'F dF3F*\ 















328 TIME-DEPENDENCE AND TRANSITION PROBABILITIES [Ch. 11, § 6, 7 


Substituting for dWjdt and dW*jdt from the Schrodinger equation (11.88) 
and its complex conjugate, we obtain 

d 2 <x> h 2 f 00 d 2 Vd'P*\ , 

JYI --— — — - I --- -| --- I dx 

dr 2mJ _ o0 \ dx 2 dx dx 2 dx J 

+ r <., 

J - oo V dx dx J 
= =-<£>■ 

again integrating by parts and remembering that IP and dW/dx vanish at 
infinity. 

We have thus shown that the mean values of the acceleration and the 
force satisfy the classical equation of motion. In the limiting case of a very 
narrow wave packet, when l P*!P is practically a ^-function, the uncertainty 
in .v will be negligible, and <*> may be identified with the classical value 
of the coordinate of the particle. In this case also the right hand side of 
(11.97) may be identified with the classical value of the force on the particle 
at position x, so that a very narrow wave packet satisfies the classical 
equation of motion. 

The extension of this work to three dimensions is straightforward. 

11.7. Wave Packets from Bloch Functions 

In § 11.4 we determined the mean value of the velocity'of an electron 
in a Bloch state k. This may be obtained in a different way, by constructing 
a wave packet from Bloch functions with wave vectors in the vicinity of k, 
and determining the group velocity of this packet. It is worth considering 
this method in detail, as it leads quite easily to an expression for the ac¬ 
celeration of a conduction electron in an external field. 

Multiplying \J/ k in (11.43) by the appropriate time factor, we obtain the 
time-dependent Bloch function 

•P k (r, t) = u k (r)Q ik r e~ U(k)t/ \ ( 11 . 98 ) 

Let us construct a wave packet from such functions, with wave vectors 
close to k\ that is, we write 

V(', l) = J ■ -«■«•!d*". (], , 99) 
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where \A(k')\ has a sharp maximum at k' = k and falls rapidly to zero 
on every side. The integration is throughout the region in which A(k') 
is appreciably different from zero. 

Expanding e(fc') in Taylor’s series about k, we obtain 

e(fc') = s(k) + (k'-k) • V fc e(fc) + . . .. (11.100) 

Since \k' — k\ is very small in the region of integration, we may neglect the 
remaining terms in the series, and, as a good approximation, write 

*F(r, t) = A{k')u k '{r)Q l< ' r ~ tVk c/ft) ‘ (k-k) dfc'. 

Also u k -(r) may be assumed to change very little with k’ in the region of 
integration, and so may be set equal to u k (r ) and taken outside the integral. 
We then have 

«P(r, t) = <F fc 0, 0 J A(fc> i(r " tV '‘ E/fi) ’ (k-k) dfc', (11.101) 

where x F k is the Bloch function given, in (11.98). 

The integral in (11.101) has constant value so long as r — tV k s/h is constant. 
Thus, if we ignore the periodic variation of u k (r ), which is independent 
of time, we may say that points at which | *F\ 2 has a given value move with 
the constant velocity 

i> = jV k e(fc), (11.102) 

n 

which is the group velocity of the packet. Comparison with equation 
(11.56) shows that v is equal to the average' velocity of an electron in the 
stationary state k. It must be remembered that, owing to the concentration 
of the function A(k') about k, the wave packet must extend throughout 
several Wigner-Seitz cells, but it may still be small compared with the 
dimensions of the metal as a whole. 

11.8. Acceleration of the Electrons in a Metal due to an External Electric 
Field 

Now let us consider the effect of an electrostatic field £ applied to a metal. 
The mean wave vector k of a wave packet representing an electron will 
now change with time; that is to say, the electron will be accelerated. We 
may assume, however, that the electron energy e(fc) remains within the same 
zone — the fields which can be applied to metals are so weak that there is 
little probability of their causing an electron to make a transition to a 
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higher zone, although such a transition is not entirely ruled out. We also 
assume that (11.102) remains true in the presence of an applied field, and 
make use of the fact proved in § 11.6, namely, that a wave packet moves 
like a classical particle. 

The work done by the field on the electron per unit time is, then, 


-eE-v= ~ ' Vfte(fc), 


(11.103) 


-eE - 


and this must be equated to the rate of change of the electron energy, 
which is 

d t d t w 

We therefore find 

dfc = _ e £ 
d t~ h 



0 A 

Fig. 11.4. Energy curve for 
a Brillouin zone of a one¬ 
dimensional metal. It is as¬ 
sumed that there are finite 
energy gaps at the zone bound¬ 
aries A and A'. A point re¬ 
presenting an electron moves 
in the direction shown by the 
arrows under the action of an 
electric field E in the negative 
^-direction. 


for the rate of change of the mean wave 
vector k of a wave packet due to the field. 

The wave vector increases uniformly with 
time in the direction of the applied force, 
so that the point which represents an electron 
in k -space will travel with constant velocity, 
until it eventually reaches a zone face. We 
assume that there is an energy gap here 
which it is unlikely that the electron will 
jump. Hence, the representative point must reappear at the corresponding 
position on the opposite face of the zone, and continue to move with the 
same velocity. In the one-dimensional case shown in Figure 11.4, if — eE 
is in the positive ^-direction, the representative point will travel along the 
energy curve until it reaches A, and will then reappear at A' and again 
travel along the same curve to A. In extended fc-space (see Figure 8.4) 
the point would continue to move in the positive ^-direction along the oscil¬ 
lating energy curve of the same zone. 

The acceleration of a wave packet is 

d» 

~dt 


= --V k e(k)= - 

h d t h \d tj 




(11.106) 
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using (11.103). The rectangular Cartesian components of the acceleration 
are, thus, 

a2„ a 2„ v 


II 

-rl •*- 

-oH 

"Pi 

(d 2 e 

(dkl 1 

dv 2 

- -1 

( d 2 a 

dt 

h 2 ' 

wfc 2 d/c, 

dyj _ 

--I 

( d 2 e 

dt 

h 2 ' 

{ .dfc 3 dfc, 


Ei + 


d 2 a 


E 2 + 


*i E \ 

ski El ) ’ 


dk$dlc2 oft .3 
and these equations may be written compactly in the form 

d 2 e 


d v t _ _ e y 
d ~t~ h 2 j dkjdkj 


Ej, ij = 1,2,3. 


(11.107) 


The classical equation of motion of an electron in the field £ is 

(11.108) 


d» _ e ^ 
dt ~ 


m 


Now, if the one-electron energy is given by 


e(fc) = 


2 1,2 


h k 


h 2 


2m* 2m* 


(kl + kl + kl). 


(11.109) 


with m* constant, as is very nearly the case in sodium, for example, equation 
(11.106) reduces to 


d» _ e 
dt m* 


( 11 . 110 ) 


which is the classical equation (11.108) with the effective mass m* substituted 
for m. In a slightly more general case, if 

.2 ■ 


= T (< + + I 

equations (11.107) reduce to 

i = 1, 2, 3, 


dth _e_ F 

j , * >' ’ 

dt m t 


( 11 . 111 ) 


( 11 . 112 ) 


and m*, m* are called the effective masses for motion in the x-, y- and 
z-directions respectively. 
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In accordance with the foregoing, equation (11.106) is often written in 
the form 


dt> 

dr 



(11.113) 


where (1 jm*) is the reciprocal mass tensor, having 
specified by 


'J_\ _ d 2 s 
m*jij h 2 dkidkj 


the nine 


components 


(11.114) 


— the rule for multiplying the tensor by the vector is simply that contained 
in equations (11.107). 


The acceleration is generally not in the direction of the force — eE, and 
may, in fact, be opposed to it. For example, in the case of one-dimensional 
motion, we have 


dy eE d 2 e 

(11.115) 

dr ~ h 2 dk 2 ’ 

and the effective mass is the scalar 

* t2 / d 2 e 

/ dk 2 

(11.116) 


If the graph of e(k) is of the type shown in Figure 11.4, we see that d 2 e/dk 2 
is negative near the zone boundaries, so that here the electron has negative 
effective mass, and the acceleration is in the opposite sense to the force — eE. 

If, in addition to the electric field E, there is also an applied magnetic 
field H, a natural generalization of the foregoing results is to substitute 
for the force — eE the Lorentz force 

— e + - v a H j 

In other words, we replace equation (11.105) by 

(11.117) 

where v is again given by (11.102). Detailed consideration shows that this 
equation is correct for most purposes. However, according to classical 
mechanics, the orbit of an electron in a combined electric and magnetic 
field is a helix. Since the work of the present section is based upon the 


d k el 1 
— = - - £+ - vaH 
dr ft \ c 
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representation of an electron by a wave packet, the dimensions of which 
cannot be smaller than those of a Wigner-Seitz cell, it is expected that 
equation (11.117) will fail when the radius of the helix becomes comparable 
with the atomic radius, that is, for sufficiently strong, magnetic fields. 

Finally, let us briefly consider the conductivity of a metal. We have al¬ 
ready noted that a perfect crystal has no resistance, for an electroh in a 
given stationary state always has the same average velocity. Hence, an 
electric current, which is due to an asymmetrical distribution of occupied 
states in k-space, will continue unchanged until the distribution is changed 
— that is to say, until some electrons have made transitions to new states. 
Although, under the action of an applied electric field £, the current may at 
first increase with time, according to (11.106), it will not increase indefini¬ 
tely, owing to the effect of the zone structure. However, the steady state 
required by Ohm’s Law will never be achieved. 

This would be true, of course, only for perfect crystals. Real crystals 
are never perfect. In the first place, even at very low temperatures, the ions 
are in a state of thermal vibration about their mean positions, which 
destroys the exact periodicity of the lattice. Secondly, there are generally 
impurities present, that is, dissolved atoms of foreign elements — it is 
hardly possible to get rid of these entirely, although they may be reduced 
to very small traces. Thirdly, there are generally flaws in the crystal structure 
itself, due to some local disarrangement of the ionic patterns — examples 
of such flaws are dislocations, stacking faults, and grain boundaries. 

The departures from perfect periodicity of the lattice field may be regarded 
as perturbations which cause some electrons to make transitions to different 
states — we S ay that the electrons have collisions with, or are scattered 
by, the lattice irregularities — and this is the reason for the electrical 
resistance of the metal. Only some electrons can make such transitions, 
namely, those near the Fermi surface. Owing to the fact that the electrons 
are so much lighter than the ions, their energy changes very little in collisions 
with the lattice, and hence electrons in states well below the Fermi surface 
cannot be scattered, because there are no vacant states of nearby energy to 
which they can transfer. The scattering is thus limited by the Pauli principle. 

When an electric field is applied, the whole Fermi distribution begins to 
move, according to (11.105), but this movement is opposed by the scattering, 
which tends to restore the original symmetrical distribution. Very soon a 
state of equilibrium is reached, when the Fermi distribution becomes static 
but displaced from its original position, and the result is a steady current. 
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The calculation of the probability of scattering per unit time, or transition 
probability, of an electron from a state k to a state k' by a lattice irregularity 
is thus the central problem of the theory of conductivity. It is also an 
extremely difficult problem, for which only approximate solutions have 
been obtained. It would take us too far afield to attempt to give an adequate 
account of this theory here, and it is not necessary to do so, since several 
excellent books and review articles are available (see Bibliography), which 
cover the subject exhaustively. We will, however, give a brief description 
of the general method of calculating transition probabilities, in order to 
prepare the reader for the more advanced treatises. 

11.9. Transition Probabilities 

Although we shall, throughout the present section, refer to the electrons 
in a metal, subject to a perturbation due to lattice irregularities, it must 
be understood that the method we shall use, known as time-dependent 
perturbation theory, is quite general, and may be applied to any perturbed 
system — a free atom in the presence of a fight wave, for example. 

Let us write the time-dependent Schrodinger equation of an electron in 
a metal in the form 


(H+V)V = ih—, 

8t 


(11.118) 


where H is the Hamiltonian of an electron in the perfectly periodic lattice, 
and U(r, t) is the difference between the potential energy in the actual 
lattice and the potential energy V(r ) in the perfect lattice. We shall assume 
that U(r, t ) is a small perturbation. 

As usual, we let (/^(r) and s(k) denote the eigenfunctions and corresponding 
eigenvalues of H, so that 



and assume that the i j/ k are normalized in the volume of the metal, that is, 



( 11 . 120 ) 


the integration being throughout the volume of the metal. According to 
the discussion at the end of § 11.1, we may then write the general solution 
of equation (11.118) in the form 




Ch. 11, § 9] 


TRANSITION PROBABILITIES 


335 


*(r, 0 = 1 A k {t)^ k {r)z-'™ tlh . 

k 


Substituting this into (11.118), we obtain 

X^W k +^ k )e" i£(k)f/fi = Z 


dA k 


A k e(k)ilJ k + ih-—\l/ h 
d t 


( 11 . 121 ) 


e -ie(fc) t /« (H. 12 2) 


Owing to equation (11.119), the first term in the summand on the left 
cancels that on the right, and we are left with 

I = i* E ^ 

k k at 


We multiply this equation through by {j/ k ., and integrate throughout the 
volume of the metal. Remembering that the i \i k are orthogonal as well as 
normalized, we find 


dA k' — A Ur 

• t, ' - 1 ^ k'k 9 

(11.123) 

at in k 


where 


Ukk = J 'l't-UiJ/ k dr. 

(11.124) 

Now we will suppose that at time t = 0 the electron is 
so that, W(r, t ) being normalized, 

in the state k it 

, / A x / 1, k = k t , 

|0, k^k t . 

(11.125) 


Furthermore, provided the interval t is not too long, we may assume that 
the coefficients A k (t) in equation (11.123) retain their values at / = 0. 
We are then neglecting the products A k U Wk , with k # k ti which are 
second-order terms, since U k , k is also small. Thus, equation (11.123) becomes 


which gives, 


dA k > 
d t 


= - k u n 

in 


i[e(fc')-e(fci)]!/« 



U k , ki e mk,) ~ eik,)V,A dt 


(11.126) 


for k' ^ k h Extending the argument of § 11.2 to non-conservative systems, 
we interpret \A k .(t)\ 2 as the probability that at time t the electron will be in 
the state k' — in other words, it is the transition probability from the state 
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ftj to the state k' in time t. We shall in future drop the subscript i on k t . 
and take the initial state to be the state k. 

The result (11.126) is true for any small, time-dependent perturbation 
U(r, t). In treating the scattering due to lattice vibrations, however, it is 
usual, in order to simplify the working, to calculate the transition probability 
for a fixed displacement of the ions, and then to average over all possible 
displacements at a given temperature. Let us therefore proceed on the 
assumption that U is independent of time (the result will, of course, be the 
same if the perturbation is switched on at t = 0 and off after time t but is 
otherwise constant). U kk is then also independent of time, and (11.126) 
becomes (with k instead of ft,-) 


M>) = 




k'k 


e(k') — e(k) 


(l_ e ! [e(k')-e(k)]t/f,j 


(11.127) 


The probability that, starting in the state k, the electron will be found 
in the state k' after an interval t is, thus, 


where 


tl col k 


(11.128) 


"ft'* = 0( fe ')- e ( fc )]/6- 


(11.129) 


For large values of t [which may still be small enough to justify the 
assumption leading to equation (11.126)] \A k '(t)\ 2 has a sharp maximum 
when a> k , k = 0 — in other words, transitions in which energy is conserved 
are most likely. For such transitions we see that \A k .(f)\ 2 is proportional 
to t 2 , for 


lim 

" ft'* - * 0 


sin 2 {a> k , k t/ 2) 


CO 


2 

kk 



(11.130) 


This is somewhat surprising, as we should have expected the transition 
probability to increase linearly with time. However, it must be remembered 
that the distribution of one-electron states in fc-space is practically a con¬ 
tinuum, so that there is a large number of final states with approximately 
the same wave vector k' and the same energy. What is of physical signifi¬ 
cance is thus the probability of a transition from the state k to one or another 
of a group of final states with approximately the same wave vector k', and 
with energy almost the same as that of the initial state (these may he, for 
example, near a particular element of the Fermi surface): we will now 
show that this is proportional to t. 
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For convenience we denote the energy c(k') of a final state by e'. We 
also denote by ^(e') the density of the final states in energy, so that 
^(E')de' is the number of final states having energies in the small interval 
de' about e'. The probability that in time t an electron initially in the state 
k will make a transition to one or another of this group of final states is then 


4 

ft 1 


J 


|2 sin 2 (oW/2) 

\ u k’k\ $ 

o> k 'k 


^(e')d£\ 


(11.131) 


The integration is over the energy range of the final states considered, but 
most of the contribution to it comes from a narrow interval about e' = e(k), 
or co k . k = 0. Since we are considering a group of states with almost the 
same wave vector k', we may therefore take | U k > k \ 2 outside the in- 


tegral sign, and write (11.131) as 


w ^wf* sin2( r ,/2) d«v* 

(11.132) 

h J- oo copk 


(little error is introduced by taking the integration over 

an infinite range, 

rather than over a finite range including co k k = 0). The integral is 

t f 00 sin 2 x ^ nt 

l)-* x 2 X 2 ’ 

(11.133) 

so that (11.132) becomes 


^\U k ' k \ 2 ^(s)t, 

(11.134) 


which is proportional to t, as required. The transition probability per unit 
time from the state k to one or another of a group of final states with wave 
vectors near k\ and with almost the same energy as the initial state, is thus 

-HV.I 2 •'*&)• (1U35) 

h 

We will leave our discussion of transition probabilities here. For the 
methods of calculating the matrix element U Wk the reader is referred to 
the more advanced works mentioned in the Bibliography. 





Appendix 1 


VECTOR ANALYSIS 



The components ( A x , A y , A z ) of a vector A, with respect to some system 
of rectangular Cartesian coordinates, are the projections of the vector on 
the axes. The length, or magnitude, of the vector A, denoted by A or \A\, 
is given by 

A — \liAx +A y +A 2 ), (Al.l) 

and is always taken to be positive. 

If e l9 e 2 , e 3 are unit vectors, that is, vectors of length unity, in the direc¬ 
tions of the x, y, and z axes respectively, then the vector A may be written 

A = 4 x e 1 + A y e 2 + A z e 3 > (A1.2) 

In most text-books these unit vectors are denoted by i, j, k, but we have 
avoided these symbols as they are rather overworked in the theory of metals. 

The radius vector or position vector of a point P(x, y, z ), with respect 
to the origin O, is the vector 

r = xe l +ye 2 +ze 3 . (A1.3) 

The distance of the point from O is, thus, 

OP = r = yJ{x 2 +y 2 -\-z 2 ). 
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(A1.4) 
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The scalar product A • B of two vectors A and B, inclined at an angle 6 


to each other, 

is defined as 



A ■ B = AB cos 9. 

(A1.5) 

Hence, 

e x ' e t = e 2 ■ e 2 = e 3 • e 3 = 1, 

(A1.6) 

and 

e y ■ e 2 = e 2 ■ e 3 = e 3 ■ e 3 = 0. 

(A 1.7) 

In terms of the components of A and B, it follows that 



A B = A x B x + A y B y + A z B z . 

(A 1.8) 

In particular, 

A- A = A 2 = A x +A y +A z , 

(A 1-9) 


in agreement with (A 1.1). 

The vector product A a B is defined as the vector C whose magnitude 

C = AB sin 9, (A1.10) 

where 6 is the smaller of the angles of inclination, and whose direction is 
perpendicular to the plane containing A and B, in the sense defined by the 
right-handed screw rule with respect to a rotation of A towards B through 
the angle 6. Hence, 

e y a e x = e 2 a e 2 = e 3 a e 3 = 0, (Al.ll) 

and 

a e 2 = c 3 , e 2 a e 3 = e lt e 3 a ey = e 2 ■ (A 1 . 1 2) 

Also, 

A A B = -BaA. (A1.13) 


It follows that the components of C are 

C x = A y B z -A z B y , ) 

C y = A Z B X -A X B Z , (A1.14) 

C z = A x B y — A y B x , j 

or, alternatively, the vector' C may be written in the determinantal form 
(see Appendix 3) 

' e t e 2 e 3 
A x A y A z 

B x By B z 


C = A a B = 


i 


(A1.15) 
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The scalar triple product F A a B is found from (A 1.15) to be 


F-AaB = 


F x Fy F z 
A x Ay A, 
B x B y B z 


(A 1.16) 


and we deduce (as may be proved geometrically, using the definition of 
A a B) that F • A a B is numerically equal to the volume of a parallelepiped 
with F, A, and B as concurrent edges. 

The vector operator V (del) is defined as 


^ d d d 
V = e t - he 2 - \-e 3 — 

dx dy dz 


(A1.17) 

The gradient of a scalar point function, or scalar field, \j/(r) is defined as 

(A1.18) 


grad ij/ = V\]y = e t — +e 2 — +e 3 ^ 
dx dy dz 


and it may be shown that, if n is a unit vector, n • Vip is the derivative of i(/ 
in the direction of n, which may be denoted by dij//dn. 

The divergence of a vector point function, or vector field, F(r) is defined as 


divf = V- F = ^ + ^Z + ^£. 

dx dy dz 

If we set F = in equation (A 1.19), we obtain 

V • Vxjj = VV = — + — + ^ 

dx 2 dy 2 dz 2 


The operator 


V 2 


d 2 d 2 d 

in + n — 

dx dy 2 


dz 2 


(A1.19) 


(A 1.20) 


(A 1.21) 


is called the Laplacian operator + 

Green's theorem, in its most useful form, states that, if \j/{r) and <j>(r) 
are scalar point functions, 

JjiAV 2 </)-0V 2 ^)dr = J (i/Afi/) - (frVi]/) • ndS, (A1.22) 

the integral on the left being taken throughout the volume v within the 


t The symbol A is often used in place of V 2 


to represent the Laplacian operator. 
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closed surface S, that on the right being a surface integral over S, and n 
being a unit vector along the outward normal to 5 1 at the position of the 
element dS. If d/dn denotes differentiation in the direction of n, (A1.22) 
may be written 


(>V 2 <£-<£VV)d 


r 




(A1.23) 


Finally, the compactness of vector notation is demonstrated in its ap¬ 
plication to Taylor’s series for a function of three variables, which is used , 
in Chapter 9. Thus, the Taylor’s series 


a — +b— +c-)f(x,y,-z ) 
dx dy ozl 

+ — (a—+b—+c—\ f(x,y,z) + ... (A1.24) 

2! \ dx dy dz) 

becomes 

f(r + l) = /(**) + ! • V/(r)+ i(Z • V) 2 /«+. . (A1.25) 


where 


l = ae 1 + be 2 + ce 3 . 



Appendix 2 


EXPANSIONS IN SERIES OF ORTHOGONAL FUNCTIONS 


A set of functions iAi(x), i }/ 2 (x), i j/ 3 (x ),... is orthonormal , that is, or¬ 
thogonal and normalized, in the interval {a, b ), if 

f </'m(*)«/'„(*)dx = <5 mn (A2.1) 

a 

for all m and n — S mn having the value unity if m = n and zero if m # n. 
It is said to be a complete set if there is no function, not identically zero, 
which is orthogonal to all the functions ip n — in other words, no function 
<fi(x) exists such that 

f \l/*(x)<f>(x)dx = 0, for all m, 


except <f>(x ) = 0. In this case it is possible to expand an arbitrary function 
f(x), in the interval ( a , b), as an infinite series of the form 

f(x) = a l ip 1 (x)+a 2 ip 2 (x) + a 3 ij/ 3 (x) + . . . 

= Y J a n 'l'n{x). (M2) 

n 


The constant coefficients a n , which may be complex numbers, are easily 
determined by multiplying both sides of (A2.2) by \j/*(x) and integrating 
over (a, b). This gives 



and, according to (A2.1), all the terms on the right hand side vanish except 
that with n = m, and for this term the value of the integral is unity. We 
thus find 



(A2.3) 


which determines the coefficients uniquely. If the functions \J/ n are orthogonal 
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but not normalized, then the right hand side of (A2.3) must be divided by 

f l«AmO)| 2 dx. 

J a 

In order that such an expansion be possible, that is to say, in order 
that the series (A2.2) may converge to f(x) in {a, b ), the function f(x) must, 
in fact, satisfy certain analytical conditions. These are very wide, however, 
and certainly less stringent than the conditions which a wave function 
must satisfy, so we will not consider them explicitly. 

It may be noticed that the foregoing is merely a generalization of the 
familiar method of expanding a function in Fourier series. The functions 

1 l! 1 . 

—= , - - cos nx, — sin nx, 

V2 n \Jn y/n 


where n = 1, 2, 3,..., constitute a complete orthonormal set in the interval 
(—n,n), and the expansion 

Cl l 00 1 00 

f(x) = + -= £ a n cos nx+ — £ b n sin nx, 

y/2n yjn n=i v 7i n = 1 


where 



1 f" 

b n = — \ f(x) sin nxdx, 

V 71" -n 

as given by (A2.3), is just the Fourier series representing/(x) in the interval 
(~ 7 i, 7 i). It is evident from this example that not all the functions of the set 
need appear in the expansion; that is, some of the coefficients may be 
zero. For instance, if we expand the function x, which is an odd function, 
in Fourier series, the coefficients a 0 and all the a„ are found to be zero, 
so that the series consists of sine terms only — it is, in fact 

x = £ -(-l) n + 1 sin nx. 
n= 1 n 

This series represents a saw-toothed function, with period 2n, which is 
equal to x in the interval ( — n, n). 
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The set of all the wave functions of a system having only discrete energy 
levels is a complete orthogonal set of the type we have been considering, 
and these functions may, of course, be normalized. Examples of such 
systems are the linear harmonic oscillator and the particle in a box. In the 
latter case, the normalized wave functions are found in Chapter 1 to be 



n = 1,2, 3, 


and these functions constitute a complete orthonormal set in the interval 
(0, L). We may therefore expand a function f(x ) in a series of the form 


/(■ 


:*) = t 


2 22, . nnx 

- 2, a n sin — , 
L n= 1 L 


valid in (0, L), where 


a » = )![ /(*) si 

LJ o 


. nnx , 

sin-dx. 

L 


If periodic boundary conditions, as defined in § 7.3, are applied to this 
system, the resulting normalized wave functions have the form 




1 


2n\nx/L 

e 5 


u = 0, ± 1, +2,. . ., 


which again constitute a complete orthonormal set in the interval (0, L). 
Since both positive and negative values of n occur, the expansion ©fa function 
J{x) in (0, L) is now 


where 


/(*) = - TP I a n e 2 ™ x/L , 

L/ n~ — oo 


a 


n 



2n\nxjL^ 


A slight complication occurs when, in addition to a discrete series of 
eigenvalues, a system also has a continuous distribution, or continuous 
spectrum, of eigenvalues. Corresponding to the latter there is a continuous 
range of eigenfunctions which must be included in the expansion of an 
arbitrary function. This may be done by adding on to the sum (A2.2) an 
integral over the continuous range of eigenfunctions. We will omit explicit 
details, however, as the formal procedure we have described is very little 
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altered. This may be understood from the fact that an integral is just the 
limit of a sum — we have seen in Chapter 7 that, even when a discrete set 
of eigenfunctions exists, provided their eigenvalues are sufficiently closely 
spaced, it is often convenient to replace a sum over them by an integral, 
and this process may be reversed. 

When some of the eigenfunctions are degenerate, it must be ensured that 
all the independent functions are included in the expansion, and these 
must be orthogonalized, if necessary, by the method described in § 2.7.2. 

Of course, although we have only referred to functions of a single variable, 
everything we have said applies also to orthogonal sets of functions of any 
number of variables — the wave functions of a particle in three dimensions, 
for example, or those of a many-particle system. It is only necessary that 
the integration in equation (A2.3) be taken over the whole configuration 
space of the system. 


Appendix 3 


DETERMINANTS AND SIMULTANEOUS EQUATIONS 


A determinant is generally denoted by a square array of symbols, called 
elements, enclosed between vertical lines, the order of the determinant being 
the number of rows (or columns) of elements. The elements may be numbers 
or functions, and the determinant is equal to a sum of products of elements, 
called the expansion of the determinant, which may be obtained by the 
application of fixed rules. For example, the expansion of a second-order 
determinant is given by 


d 2 

hi b 2 


Cl i b 2 — ^2^1’ 


The expansion of a third-order determinant is given by 


a l a 2 a 3 
^1 ^2 ^3 

Ci C 2 C 3 




*1 

C 1 



b 2 

C 2 


l 


a 1 b 2 c 2 a ib 3 c 2 — a 2 b 1 c 3 + a 2 b 3 c 1 + a 3 b l c 2 — a 3 b 2 c 1 . (A3.1) 

We note that each term is of the type ajjfy, where (ijk) is a permutation 
of the numbers (123), and all possible permutations are included. It may be 
seen also that the permutation (132) can be obtained from (123) by a single 
interchange of the numbers 2 and 3; the permutation (213) is also obtained 
by a single interchange, of 1 and 2; and the permutation (321) may be 
obtained by three interchanges, for example, 1 and 2, then 1 and 3, then 
2 and 3. These permutations, requiring an odd number of interchanges, 
are called odd permutations — although they may be obtained from (123) 
by different successions of interchanges, the number will always be found 
to be odd. All the other permutations may be obtained by an even number 
of interchanges, and are called even permutations — for example, (231) is 
obtained from (123) by interchanging 1 and 2 and then 1 and 3. Now, the 
terms for which (ijk) is an odd permutation of (123) have negative sign, 
and the others have positive sign. Thus, if we introduce a permutation 
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operator P to act upon the subscripts, such that P a t b 2 c 3 = a 2 b 2 c l , for 
example, and let p be the number of interchanges making up this permutation, 
then we may write (A3.1) concisely as 

X(-l) p PM 2 c 3 , 

? 

the sum being over all six permutation operators, including the identity 
operator which leaves (123) unchanged. 

If the elements of the determinant are functions, as in Chapter 5, we have 

i) 4>i(x 2 ) <M* 3 ) 

4> 2 (x i) <t> 2 (x 2 ) 4> 2 ( x 3) 

4 > 3(^1) <^3(^2) <£ 3 (* 3 ) 

= <M*i) 4> 2 (x 2 ) 4>3 (x 3 ) - 4> i(*i) <^(* 3 ) <M* 2 ) 

- <M* 2 ) 0 2 (*i) <£ 3 (* 3 ) + 0 i(* 2 ) ^2(^3) <M*i) 

+ (f> \(x s') 4> 2 (.X l) 4* 3(.X 2 ) 4 > l(. x 3')4 > 2 (. x 2 ) ( t > 3( x l) 

= £(-l) P P0l(x 1 )</> 2 (X2)0 3 (^3)- 

P 

It may easily be verified that here the operator P may act upon either the 
subscripts of the 0, or the subscripts of the x t . A determinant of any order 
may be expanded in the same way, so that 

E (-l) p P<M*i)0 2 (* 2 ) • • • <£«(*«) 

P 

is an nth order determinant, the sum being over the n\ permutations of the 
subscripts (123 . . . ri) of either the (j>{ or the x*, p again being the number 
of interchanges in the permutation P. 

It follows that, if two rows (or two columns) of the determinant are 
interchanged, the sign of the determinant is changed, for in order to obtain 
the same permutations of the subscripts p must then in every ease be in¬ 
creased by 1, multiplying each term of the original expansion by -1. It 
also follows that, if the corresponding elements in two rows (or two columns) 
are the same, the determinant is zero, for an interchange of these rows (or 
columns) cannot alter the determinant, yet we have just seen that such an 
interchange must change the sign — if the determinant is unaltered by 
changing its sign, it must be zero Other properties of determinants may be 
obtained in the same way, but these are the essential ones required in the 
text. 

An application of determinants, which we have used in Chapter 3, is 
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to the solution of simultaneous linear equations. For example, let us con¬ 
sider the pair of linear equations 

a l x+a 2 y = 0, 
b 1 x+b 2 y = 0, 

in the two unknowns x and y, the coefficients being constants. Such equations 
are said to be homogeneous, because all the terms are of the same type — 
in particular, there is no term not containing x or y. One solution is 
x = y = 0, but this is generally of no physical significance. However, this 
is the only solution, unless 

a l _ a 2 

b i b 2 ’ 

that is, unless 


! &i 


a- 


= 0 . 


If this is so, the equations have a non-zero solution and are said to be 
consistent. It should be noticed, however, that they cannot be solved for 
x and y separately, but only for the ratio x/y; thus, 


_ __ _ a 2 _ _ u 2 

y a i b i' 

As a slightly less trivial example, let us consider the three homogeneous 
equations 

a t x+a 2 y + a 3 z = 0, 
b ^x b 2 y -\-b 3 z = 0, 

C\X + c 2 y + c 3 z = 0, 

in the three unknowns x, y, and z. One solution is, again, x = y = z = 0, 
but we want the condition for the existence of a non-zero solution. By 
straightforward solution of the last two equations for the ratios x/z and 
yjz, we find 


X 

-y 


z 

1 b 2 b 3 

! b l b 3 


b i b 2 

C 2 c 3 

1 \ C 1 C 3 


c 1 c 2 

in the first equation gives 


b 2 b 3 

C 2 C 3 

hi b 3 \ 
~a 2 1 3 I 

1 c i c 3 j 

. n 1 b l b 2 1 
+ a 3 1 

1 C 1 C 2 \ 


= 0 , 










A3] 


DETERMINANTS 


349 


or 


u i 

bi 

Cl 


a 2 

b 2 


C 2 


«3 


c 3 


= 0, 


which is the condition, for consistency. Similarly, in the case of n homo¬ 
geneous equations in n unknowns, a non-zero solution exists only if the 
determinant of the coefficients vanishes. Even if consistent, however, the 
equations can only be solved for the ratios of the unknowns. 




Appendix 4 


SOME USEFUL INTEGRALS 


C™ - n\ 

1. x"e "*dx = —— , n > —1, a > 0. 

I ^.n + 1 


a 


2. fY-dx-j/*. 


-dx = - . 

0 x 2 


00 sin x , 7t 


4. An integral of the type 



where p(r) is a spherically symmetrical function, and the integrations are 
over all space, may be evaluated using the fact that it represents twice the 
self-potential energy of a spherically symmetrical charge distribution of 
density p(r). 

Let us consider the work which must be done in building up this charge 
distribution outwards from the origin, and let us assume that a sphere of 
charge of radius r has already been completed. The potential at the surface 
of this sphere is 


- f Anr 2 p(r) dr. 

rJ o 


The work which must be done to bring sufficient charge from infinity to 
form a shell of thickness dr, and density p(r), at the surface of the sphere is, 
therefore, 



and it follows that the self-potential energy of the whole distribution is 

Too /V 

4nr p(r) dr 4nr 2 p(r)dr. 


o Jo 
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The required value of the integral is twice this, namely, 

32k 2 | rp(r)dr f r 2 p(r)dr. 

Jo Jo 

As an example, in § 3.3 it is required to evaluate 

ff— e" 2a(r,+r2) dr 1 d»' 2 . 

JJ r 12 

Here p(r) = e -2ar , so that the value of the integral is 


32k 


! f °°re _2ar dr f r 2 e 
Jo j I 


2 -2olt 


dr 


o 

00 


= 32tt 2 
= 32k 2 

- 5 jL 

~ 8a 5 


r™- [T - (L + _L + -LI 

J n L4a 3 \2a 2a 2 4a 3 / 


-2 ar 


dr 


1 1 


1 6 


1 


1 1 


,4a 3 (2a) 2 2a (4a) 4 2a 2 (4a) 3 4a 3 (4a) 2 J 


If the integrals are not over all space, but are to be taken throughout a 
sphere, centred at the origin, and of radius R, it is only necessary to sub¬ 
stitute R for oo in the upper limit of integration. 
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PROBLEMS 


Problems (Chapter 1) 


1.1. Tne most rapidly moving valence electron in metallic sodium, at 
the absolute zero of temperature, has a kinetic energy of about 3 eV. Show 
that its de Broglie wavelength is roughly 7A. 

[1 eV = 1.6020 x 1CT 12 ergs, h = 6.624 x 10“ 27 erg sec, 
m = 9.1066 x 10" 28 g.] 

1.2. Show that the wave functions of a system are unchanged if the 
potential energy is increased by the same amount everywhere. What happens 
to the energy levels? 

1.3. Equation (1.28) may be solved graphically, for given L .and V 0 , 
by plotting both sides as functions of E and finding the values of E at 
which the two graphs intersect. By considering this graphical method of 
solution, show that, for a particle in a one-dimensionaJ well, there is always 
at least one bound state, no matter how small V 0 may be. 

1.4. Show that the wave functions of the three states of lowest energy 
of the harmonic oscillator, given in equations (1.64), (1.65) and (1.66), are 
normalized and orthogonal. 

1.5. In the problem of the particle in a one-dimensional box, discussed 
in § 1.5, suppose that L is so large that the energy levels (1.29) form practically 
a continuum. Find the number of states with energies less than E, and by 
differentiating this with respect to E, show that the density of states (that 
is, the number of states per unit energy range) at energy E is 



[This is the one-dimensional analogue of the problem discussed in § 7.2.] 
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Problems (Chapter 2) 

2.1. Find the wave functions and energy levels of the particle in a box, 
discussed in § 2.3, if the box has unequal sides a, b, c. 

2.2. The potential energy of a three-dimensional isotropic harmonic 
oscillator is 

F(r) = \mcolr 2 . 

Write down the Schrodinger equation for this system in Cartesian coor¬ 
dinates, and separate it into three ordinary equations. HenCe, using the 
results of § 1.8, find the three lowest energy levels, and state the degeneracy 
of each. 

2.3. Show that the average value of the potential energy of the electron 
in the ground state of the hydrogen atom is —e 2 /a 0 . 

Show also that the most probable distance of the electron from the 
nucleus, in the same state, is a 0 . [Hint: Find the probability of the 
electron being in a thin spherical shell of radius r, and calculate the value 
of r for which this is a maximum, keeping the thickness constant.] 

2.4. Verify the uncertainty relation (2.152) for any state of the particle 
in a one-dimensional box (§ 1.5), and also for the ground state of the har¬ 
monic oscillator (§ 1.8). 

2.5. Show that the operators L x , L y , L z , representing the components 
of the angular momentum of a particle about the origin, satisfy the com¬ 
mutation relations 


[L x , L y \ — ihL z , [L y , L z ] ifiL x , [L Z ,L X \ 


i hL v 


2.6. Show that the functions e i<#> , 1, e ** form an orthogonal set in the 
interval (0, 2n) of (p, and that so do the functions sin </>, 1, cos (p , obtained 
by linear combination of the members of the first set. 

2.7. If the operator S' is a function of q, the coordinate conjugate to the 
momentum p, prove that 

r ci h 8S 
i oq 

2.8. Given three independent degenerate wave functions which are not 
necessarily orthogonal, construct linear combinations of them which are 
both normalized and orthogonal. 
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Problems (Chapter 3) 


3.1. Use the variational method to obtain the energy of the ground state 
of a hydrogen atom, assuming a trial function of the form t~". 

3.2. The potential energy of a perturbed harmonic oscillator is 

V(x ) = ^rncolx 2 + Xx 4 , 

where X is a small constant. Obtain an approximation to the energy of the 
ground state by means of first-order perturbation theory. 

If the perturbing term were lx 3 , show that the first-order correction to 
the ground state energy would vanish, and obtain the first-order correction 
to the energy of the first excited state. 

3.3. The potential energy of an electron is zero within a sphere of radius 
R, centred at the origin, and it is one rydberg outside this sphere. Use the 
variational method to obtain an approximation to the energy of the ground 
state, assuming a trial function of the form e~ ar . 


3.4. A helium-like atom has two electrons and atomic number Z. Write 
down its Hamiltonian in the atomic units defined in § 3.3. Show, by means 
of the variational method, that the best approximation to the ground 
state energy, given by a trial function of the form e -a(ri+r2) , is -2(Z-^g-) 2 
ryd, and that this is obtained when a = Z--Ar. 

Verify that the ground-state energy of the hydrogenic atom which remains 
when one electron is removed completely is -Z 2 , and hence that the first 
ionization energy (the energy required to remove one electron) of the 
helium-like atom is approximately Z 2 -|Z+^ r ryd. In the case of helium 
compare the value given by this expression with the observed value, 1.80 ryd. 

3.5. The particle in a one-dimensional box, discussed in § 1.5, is sub¬ 
jected to a perturbing field in which the potential energy of the particle 
is Xx, where X is a small constant (for example, the particle might be an 
electron, and the applied field a uniform electrostatic one). Show that 
the first-order perturbation correction to the energy of the state \p n is XL/2, 
and that the second-order correction is 


2r4 


32m XL 


0 -") 


J**n 


the sum being over all positive integral values of j except n. 
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Problems (Chapter 4) 


4.1. Using the order of filling one-electron states given in § 4.3, write 
down the normal configurations of A1 (13 electrons,) Fe (26 electrons), 
Rb (37 electrons), Cd (48 electrons) and Pb (82 electrons), in each case 
noting which shells are closed. 

4.2. Calculate the first ionization energies of He, Li, Be and B, in their 
ground states, on the assumption that electronic interaction may be neglected, 
and compare your results with the experimental values of 24.5, 5.4, 9.3 
and 8.3 eV respectively. 

4.3. Show that the total charge cloud of the electrons in a closed L shell 
is spherically symmetrical. 


4.4. Show that E in equation (4.11) is given by 

E= £<:,-£ f IWr,)| ! ^(r,)dr, + £ IT- «r 1 )| 2 l* J ('j)l 2 <M<> 

t=i i = i J rij 

when the functions iare normalized. 


4.5. Write down the Hartree equation, in the atomic units defined in 
§ 3.3, for an electron in the ground state of a helium-like atom with two 
electrons and atomic number Z. Obtain a first approximation to the eigen¬ 
value e of this equation by assuming the one-electron wave functions to 
have the form exp [~(Z--^)r]. Compare -s with the approximation to 
the first ionization energy of this atom obtained in Problem 3.4, and note 
that the two are very nearly the same when Z is large. 

[—£ = Z 2 — |Z+xre, approximately.] 

4.6. Show that the one-electron wave functions obtained by the Hartree 
method for an atom are not orthogonal. 


Problems (Chapter 5) 

5.1. An electron moves in a central electrostatic field. When a magnetic 
field is switched on in the positive z-direction the Schrodinger equation is 
equation (5.9). Treat the term due to the magnetic field as a perturbation 
and assume that the unperturbed state is a p state with degenerate wave 
functions 

R(r) sin 6 sin (j), R(r) cos 9, R(r) sin 0 cos </>. 
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Set up the secular equation and solve it, showing that the perturbed energy 
levels are given by equation (5.11) with m t = —1, 0, 1. 

5.2. Write down the determinantal wave function for the ground state 
of a system consisting of two helium atoms, according to the Heitler- 
London method. Assume that two of the electrons occupy the Is atomic 
orbital t j/ A , centred on nucleus A, with opposite spins, and two the Is atomic 
orbital ij/ B , centred on nucleus B. By adding and subtracting rows of the 
determinant (which leaves it unchanged) show that it is equivalent to a 
determinant formed from the functions ip A ±ij/ s , with appropriate spin 
factors. Deduce that the Heitler-London and LCAO approximations 
lead to the same results for this system. 

5.3. Prove, for two-electron and three-electron systems, that the only 
antisymmetric wave function of the form (5.22), is the determinant (5.24). 

5.4. Consider the normalized determinantal wave function 

<Ai(*i) • • • </>i(%) 

* • ? 

0jv( x i) • • • (Pn^n) 
the normalized, orthogonal functions being given by' 

4>i(x) = t/'.O'M0, for i = 1 to n, 

and 

= «Ai(»W), for i = n +1 to N. 

If F is a symmetric operator, independent of spin, prove that 

J <P*F <P dr' = j F* F F dr, 

where 

*An + l(r n + i) . . . ^n+i(r N ) 

• ) 

•AiV^n+l) • • • 

the first determinant being formed from orbitals associated with spin a 

and the second from orbitals associated with spin /?, but spin being otherwise 
ignored. 

Prove also that F is normalized. 


F = 


V nl(N — n)l 


^i( r i) • • • <Ai (r N ) 
•A-Ox) • • • iA n (0 
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Problems (Chapter 6) 

6.1. Construct the Hamiltonian operator, similar to (6.1), for an alloy 
consisting of N t atoms with atomic number Z x and N 2 atoms with atomic 
number Z 2 . If the atoms are monovalent and divalent respectively, and the 
ion-cores are small, obtain a modified Hamiltonian, similar to (6.4). 

6.2. Verify that for the ground state of a helium atom, in which the two 
electrons occupy the same orbital, with opposite spins, the Fock equation 
(6.22) reduces to the Hartree equation (4.18). What is the exchange energy 
in this case? 

6.3. The energy of a metal corresponding to the determinantal wave 
function <P of Problem 5.4 is given by equation (6.18). Show that the same 
energy is obtained from the formula J W*HW dx, where W is the product 
of determinants also given in Problem 5.4. 


Problems (Chapter 7) 

7.1. Obtain the relation between pressure and volume in a free-electron 
gas at 0°K, using the Hartree theory [see equation (9.87)]. How is this 
modified when the Hartree-Fock theory is used? 

7.2. A free-electron gas would be ferromagnetic if all the electrons had 
the same' spin, so that in the ground state at 0°K all the orbitals of lowest 
energy would be occupied by one electron each. Show that, according to 
the Hartree-Fock theory, the average energy per electron would be 

3.51 1.154 


By comparing this with equation (7.54) show that, according to this theory, 
a free-electron gas would be ferromagnetic if the atomic radius were greater 
than about 5.5 B.u. What factor has been omitted which might prevent this? 

7.3. Find the average charge density at distance r from a given electron 
in a ferromagnetic free-electron gas at 0°K, according to the Hartree-Fock 
theory. 
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Problems (Chapter 8) 

8.1. Find the number of states per atom inside a sphere which just 
touches the faces of the first Brillouin zone of the simple cubic, body- 
centred cubic/and face-centred cubic lattices. [Ans.: 1.05, 1.48, 1.36.] 

8.2. Draw free-electron energy contours in the first and second Brillouin 
zones for a two-dimensional square lattice, according to the extended and 
the reduced zone schemes. 

8.3. Compare the free-electron energy at the centres of'the faces and at 
the corner points of the first Brillouin zone for each of the cubic lattices. 

8.4. Show that, in the application of the tight binding method to a 
simple cubic lattice given in § 8.10, one of the energy contours on each 
of the planes k t = 0, k x = n/a is a square. 

8.5. Show that, according to the tight binding approximation using only 
atomic s states, the one-electron energies for the body-centred cubic and 
face-centred cubic lattices are given by 

e(/c) = £ 0 —a —8y cos k x a cos k 2 a cos k 3 a, 

£ (M) = £o -a- 4y (cos k x a cos k 2 a +cos k 2 a cos & 3 a + cos k 3 a cos k t a), 

respectively, where 2 a is the length of the edge of the cubic cell. Show also 
that, near k = 0, the effective mass is h 2 /Sya 2 in both cases. 

8.6. Show that, when periodic boundary conditions are used, the Hartree 
self-consistent field in a metal has the period of the lattice. 

8.7. The potential energy of a particle in a one-dimens'ional lattice 
has the constant value V 0 in regions of width b, which are separated by 
regions of width a in which the potential energy is zero. Solve the Schro- 
dinger equation in two consecutive regions and, assuming that the wave 
functions have the Bloch form, prove that 

cos k(a + b J = cos pa cosh yb+ [(y 2 -p 2 )/2py] sin pa sinh yb, 

where 

P = (2 mE/h 2 ? and y = [2m(V 0 -E)/h 2 ]*. 

[E is the energy of the state with wave number k.] 

If V Q is allowed to approach infinity and b to approach zero in such a 
way that bl 0 remains finite, the limiting value of the expression mabV 0 /h 2 
being P, show that 

cos ka = (P/pa) sin Pa + cos pa. 
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Plotting the right hand side as a function of Pa, and noting that the left 
hand side lies between — 1 and 1 for real values of k, observe that there are 
alternately allowed and forbidden energy regions, the latter becoming 
smaller as the energy increases. [This is the Kronig and Penney model 
of a one-dimensional lattice.] 


Problems (Chapter 9) 


9.1. By analogy with equation (8.85) for the faces of a Brillouin zone, 
obtain the equations of the faces of the atomic polyhedra of the two cubic 
lattices shown in figure 9.1. 

9.2. According to the method of Wigner and Seitz, the Schrodinger 
equation for the lowest electronic state in metallic hydrogen within an 
atomic sphere, using the atomic units defined in § 3.3, is 



subject to the boundary condition 


#o 

dr 


0 at r = r„ 


Find e 0 as a function of 'r s , approximately, assuming ift 0 to be roughly 
given by 

1 P 0 = Q~ r 


[This is not a true solution of the Schrodinger equation, but it satisfies the 
correct boundary condition and tends to e - ' as r s tends to infinity, as the 
correct solution should.] 

Obtain an approximation to the average energy E per electron by adding 
to 6 0 the free-electron Fermi energy. Plot £ as a function of r s , and find 
approximate values of the cohesive energy and equilibrium atomic radius. 


9.3. It can be shown that for a metal of valency Z the eigenvalue of 
equation (9.10) depends upon r s approximately according to the equation 

e 0 (r s ) - Z \ ~ — —) , (atomic units) 

\r s rj 

for values of r s near the equilibrium atomic radius, A being a constant. 
Use this equation in conjunction with equation (9.92), taking a to be 
unity, to calculate A for metallic magnesium, given that the equilibrium 
atomic radius is 3.3 B.u. Hence find an approximate value for the com¬ 
pressibility of this metal. 
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Problems (Chapter 10) 

10.1. Using published tables of atomic radii or atomic volumes, calculate 
fico p for beryllium, magnesium, aluminium and copper. [Ans.: 19, 11, 
16, 11 eV.] 

10.2. Calculate the band widths of sodium, magnesium and aluminium, 
according to equation (10.65), with /? given by (10.91), and compare them 
with the calculated and experimental values given in Table 1 of § 7.9. 

10.3. Verify equation (10.76) for (H s t ) 0J . 

10.4. Plot the graph of the correlation energy given by equation (10.93), 
for r s between 1 B.u. and 6 B.u., and compare it with that of the Wigner 
formula (9.82). 

10.5. An electron with an energy of several thousand electron volts and 
momentum p penetrates a thin metallic film. In so doing it excites a single 
plasmon, losing energy heo and being scattered through an angle 9. If hk 
is the momentum absorbed by the plasmon, show that, to a good approxi¬ 
mation, 

p6 — hk. 

Using this expression in the dispersion relation (10.3), obtain the ap¬ 
proximate equation 

2 

hco = hco p +y — 9 2 , 
m 

where y = 3 C 0 /5hco p> assuming a free-electron gas in the metal. [This 
relation between energy loss and scattering angle has afforded an experimen¬ 
tal test of the plasma theory.] 

If the energy of the incident electron is 25 keV, show that for aluminium, 
with hco p = 16 eV, the maximum scattering angle, assuming k c to be given 
by (10.91), is about 10“ 2 radians. 

Problems (Chapter 11) 

11.1. Extend the work of § 11.6 to three dimensions. 

11.2. The wave function of the electron in a hydrogen atom in th> state 
n = 2, / = 1, m, = 1 is 


\j/ = Are r,a sin 0e ,<#> , 
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where a = 2h 2 /me 2 and A is a constant. Using equation (11.34) for the 
current density, show that the magnetic field at the nucleus due to the 
orbital motion is e 1 m 2 l2Ah 5 c gauss. [The field due to a charge — e moving 
with velocity v, at a point whose position vector with respect to the charge 
is r, is erAv/cr 3 .] 

11.3. For the wave packet given in equation (11.67), show that the 
maximum probability density decreases to one half its value at t = 0 
when the packet has travelled a distance Ana 2 3/A, where X is the mean 
wavelength. 

11.4. Find the acceleration of an electron in an electric field E when 
s(k) is given by equation (8.116). 

11.5. Assuming the free-electron model of a metal described in Chapter 7, 
calculate the matrix element U k k for the scattering of an electron from 
an initial state k into a final state k' by a field in which the potential energy 
of the electron is 



r 


where A is a positive constant. [The necessary integration may, to a suf¬ 
ficiently good approximation, be taken throughout the whole of space, 
rather than throughout the volume v of the metal.] 

If k lies on the Fermi surface, show that the probability per unit time of 
a transition from k to a region of the Fermi surface lying within an element 
of solid angle dco about the direction of k' is 

8n 2 e 4 N(£ 0 )dco 

/m 2 (A 2 + |k-k'| 2 ) 2 ’ 

where N (( 0 ) is the density of states in energy at the Fermi surface, according 
to the free-electron approximation. 
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